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PREFACE 


This textbook is designed for first and second year undergraduates 
in Physics, Engineering and Mathematics. Generally, only simple 
algebra and calculus are used, at a level the average student should 
understand. The text is developed from the author’s, An Introduction to 
the Theory of Relativity, though the opportunity has been taken to add 
additional material and to change the emphasis, in several places. 
The author has found that many students are interested enough in 
relativity to want to read around the subject. Consequently, the 
present text contains more than is generally given in a first lecture 
course on relativity, so that the text can be used to extend lecture 
courses. 

The present text follows the historical approach, in which special 
relativity is presented as a natural consequence of the extension of 
the principle of relativity to the laws of classical optics and electro¬ 
magnetism. However, most of the historical work on the ether 
theories has either been omitted or relegated to Appendix 1. The 
author has found that, with elementary students, it is better to 
proceed as directly as possible from Newtonian mechanics to the 
theory of special relativity. With such students it is hardly worth 
while cultivating the confusing ether points of view in detail, merely 
to replace them immediately with the relativistic point of view. If 
they wish, they can return to study the ether theories in detail later, 
as part of the history of science, after cultivating the relativistic 
point of view. 

The Lorentz transformations are developed in the conventional 
way in Chapter 3, using the principle of relativity and the principle 
of the constancy of the speed of light. An alternative approach, 
based on the same axioms, but using radar methods to determine the 
positions and times of events, is given in Appendix 6. This is 
Professor H. Bondi’s method of the if-calculus. The author has found 
that students find this method of measuring the positions and times 
of distant events easier to follow than using spatially distributed 
synchronized clocks. It is to be hoped that more and more teachers 
will try the approach of Appendix 6. 

In Chapter 5, the Lorentz transformations are used as a heuristic 
aid to develop the laws of relativistic mechanics. There is now, 
however, independent experimental evidence in favour of relativistic 
mechanics, which can now be developed from experiments, inde¬ 
pendently of the Lorentz transformations. The theory of special 
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relativity can now be presented as a natural consequence of the 
extension of the principle of relativity to these new experimental laws 
of high-speed mechanics. An account of this method is given in 
Appendix 4. The Lorentz transformations can be developed from 
the principle of the constancy of the limiting speed of particles, 
developed in Section 5.4.2 and Appendix 3(b). Some readers may 
find it more convincing to approach special relativity from the 
viewpoint of experiments on high-speed particles, since in this case 
deviations from Newtonian mechanics are large. When approached 
via optics and electromagnetism, most of the experimental evidence 
in favour of special relativity is the absence of some small second 
order effects, which should have been present if the Galilean 
transformations were correct (cf. Appendix 1). The approach to 
special relativity via mechanics gives the reader an alternative 
point of view, based on experiments with particles, rather than 
waves. 

In Chapter 6, the two methods of representing the Lorentz 
transformations geometrically are discussed. Minkowski’s method 
of using real variables is a natural extension of the A-calculus 
methods of Appendix 6. Four-vector methods are used to redevelop 
special relativity in Chapter 6. 

Not all students want a comprehensive discussion of relativistic 
electromagnetism in their first course in relativity, so that only a 
very brief insight into relativistic electromagnetism is given in 
Chapter 7, of the present text. More advanced readers can go on to 
Introduction to the Theory of Relativity or to the author’s forthcoming 
book Electromagnetism via Relativity, an Alternative Approach to Maxwells 
Equations for comprehensive discussions of conventional relativistic 
electromagnetism. The latter book also includes the development 
and interpretation of Maxwell’s equations from Coulomb’s law, 
using the force transformations of special relativity. 

A full account of the Clock Paradox is given in Chapter 8. In 
Chapter 9, a brief account is given of some of the features of the 
theory of general relativity based on the Principle of Equivalence. 
This Chapter illustrates some of the limitations of the special (or 
restricted) theory. A large number of problems is given at the 
end of most chapters. 

The author would like to thank Professor G. K. T. Conn for his 
encouragement throughout the preparation of the manuscript. 


Exeter 


W. G. V. Rosser 
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HISTORICAL INTRODUCTION 
I—NEWTONIAN MECHANICS 

1.1. INTRODUCTION 

The enormous success of Newtonian mechanics in interpreting many 
of the phenomena familiar to them in their daily lives gives students 
the impression that it is infallible and correct in every detail. It is 
surprising how many students are hazy about the fundamentals 
of Newtonian mechanics, even about such things as the definition 
of inertial mass and force. At school level the understanding of 
physics is largely based on familiarity with the equations of physics. 
When problems are set in mechanics, if they can use the appropriate 
equation to obtain the correct solution to the problem, students feel 
that they understand the subject. In this chapter some features of 
Newtonian mechanics will be discussed a little more critically than 
is normally done at school level, in preparation for the changes in 
interpretation necessary when one comes to discuss the theory of 
special relativity. A few features of scientific theories in general will 
be pointed out within the context of Newtonian mechanics. This 
may help to make it a little easier for those readers meeting rela¬ 
tivity for the first time to accept that Newtonian mechanics is not 
infallible, and that, when the velocities of the particles are com¬ 
parable with the velocity of light, then Newtonian mechanics must 
be replaced by an entirely new theory. 

1.2. THE STANDARDS OF LENGTH, TIME AND MASS 

The feature which marked the rise of physical science in the 
sixteenth and seventeenth centuries was that scientists began to 
make quantitative observations and to carry out experiments under 
controlled conditions. In order to make quantitative measurements, 
standards have to be chosen. This book begins with a review of the 
primary standards of the so-called fundamental units of length, time 
and mass. 

The international standard of length is the metre. Until recently 
it was defined as the distance between two defining marks engraved 
on a platinum—iridium bar, when the temperature of the bar was 
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HISTORICAL INTRODUCTION—NEWTONIAN MECHANICS 

that of melting ice. The standard metre was kept at the International 
Bureau of Weights and Measures at Sfevres near Paris. Accurate 
copies of this standard metre were made and these secondary 
standards were then used to calibrate other secondary devices such 
as rulers, which were then used for measuring lengths. The standard 
metre was calibrated in terms of the wavelength of the red cadmium 
line and other spectral lines. For example, using a Fabry and Perot 
interferometer, it was shown by Benoit, Fabry and Perot in 1913 that 
there were 1,553,164-13 wavelengths of the red cadmium line in a 
metre of dry air at 15°C and 760 mm pressure. The wavelength of 
the cadmium line under these conditions was 6438-4696 A. This 
wavelength was used for many years as a secondary standard of 
length and could have been used to calibrate a new standard metre 
to an accuracy of at least one part in a million. At the Eleventh 
General Conference on Weights and Measures meeting in Paris in 
October 1960 it was decided to replace the old definition of the 
metre in terms of the platinum-iridium prototype by the statement 
that the metre is 1,650,763-73 wavelengths of the orange-red line of 
krypton-86. The new metre agrees with the old to within about one 
part in 10 7 . If one were in a rocket, moving with uniform velocity 
relative to the earth, one could still use the new definition of the 
metre as the unit of length, and one could build an optical apparatus 
from materials in the rocket and use it to calibrate secondary 
standards such as rulers, which were at rest relative to the rocket. 

In order to measure time one must choose a repetitive process and 
assume that it recurs at constant time intervals. Any repetitive 
process could be chosen as a primary standard; the one actually 
chosen was the mean solar day, which was defined in terms of the 
average time taken by the earth to make one complete revolution on 
its axis with respect to the sun. On account of the orbital motion of 
the earth around the sun, the solar day varies during the year; 
hence the average value was taken as the standard. Astronomers 
prefer to take the period of rotation of the earth relative to the 
‘fixed’ stars, and they use sidereal time. In the laboratory, clocks 
such as an oscillating pendulum can be calibrated in terms of the 
mean solar day. The frequency of electrical signals can be measured 
very accurately and quartz crystal clocks can be used as accurate 
secondary standards. The frequencies associated with some atomic 
and nuclear processes have also been used as secondary standards 
of frequency, for example in ammonia vapour and caesium clocks. 
There may be secular changes in the angular velocity of the earth’s 
rotation so that the mean solar day may vary from year to year. 
Pending a possible re-definition of the second in terms of an atomic or 
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THE STANDARDS OF LENGTH, TIME AND MASS 

molecular frequency, the Eleventh General Conference on Weights 
and Measures decided to define the second as 1/31,556,925-9747 of 
the tropical year 1900. This fixes the unit of time. If one were in a 
rocket moving relative to the earth one could set up a secondary 
atomic clock and calibrate other secondary clocks at rest in the 
rocket in terms of the frequency of an atomic process, the atomic 
frequency having previously been determined in terms of the solar 
year of 1900 using apparatus at rest on the earth. 

The international primary standard of mass is the kilogram; this 
is defined as the mass of a cylinder of platinum-iridium kept at 
Sevres. The comparison of the gravitational masses of other bodies 
with the standard can be carried out by weighing. The determina¬ 
tion of inertial mass involves the application of the theory of 
mechanics and a discussion of this is deferred until after the con¬ 
sideration of Newton’s laws of motion. In principle, the rest mass 
of the proton could be used as a secondary standard of mass suitable 
for calibrating masses at rest on a rocket moving with uniform 
velocity relative to the earth. 

Other mechanical quantities are expressed in terms of mass, 
length and time. For example, the velocity of a body is defined as 
the distance the body moves in unit time. The velocity of light 
in vacuo can be measured accurately, and it has been suggested 
that one could adopt the velocity of light in vacuo as a primary 
standard, assigning it an arbitrary value. A time interval could 
then be measured in terms of the distance travelled by light in 
vacuo in that interval. Alternatively, one could measure a distance 
in terms of the time light or radio signals take to travel that distance. 
This is the method used in determining distances by radar methods. 
(Gf. Section 3.9 and Appendix 6.) 

Physical measurements on bodies are carried out by comparison 
with the primary standards (or with secondary standards which 
have themselves been calibrated by comparison with the primary 
or other secondary standards). For example, the length of a 
stationary body can be determined in practice by comparing the 
positions of the ends of the body relative to a secondary standard 
such as a ruler, whose length has been subdivided into equal 
intervals. In this way an experimental determination of the length 
of the stationary body is made, and the physicist is satisfied with a 
statement that the measured length of a body is a number between, 
say, 13-3 and 13-4 cm. Generally the physicist does not stop to 
worry about terms such as the correct or absolute length of a body, 
though, of course, he is always striving to improve the accuracy of 
his experimental measurements. 
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HISTORICAL INTRODUCTION—NEWTONIAN MECHANICS 

It is common practice to use symbols to denote the magnitudes 
of physical quantities relative to the primary standards. This 
enables the laws of physics to be written in a mathematical form. 
The mathematical expressions of the laws of mechanics do not 
depend on the actual sizes of the arbitrary units of mass, length and 
time. The sizes of the fundamental units chosen affect only the 
values of the numerical constants in the mathematical equations 
expressing the laws of physics. 

1.3. A CRITIQUE OF NEWTONIAN MECHANICS 

Prior to the time of Galileo, the prevailing views on motion were 
derived largely from Aristotle, and when they were not derived 
from pure reason without recourse to experiments, they were 
generally based on qualitative observations only. In the sixteenth 
and seventeenth centuries under the influence of people such as 
Copernicus, Tycho Brahe, Kepler, Galileo, etc., a new outlook arose 
based on quantitative observations and systematic experimentation 
under controlled conditions. From a limited number of observations 
laws were postulated. For example, from a study of the time taken 
by metal balls to roll down an inclined plane, Galileo postulated the 
law of falling bodies. At this stage in the development of science, 
physical laws were generally derived directly by ‘induction’ from 
a limited number of quantitative observations. The early investi¬ 
gations on mechanics culminated in Newton’s laws of motion and 
Newton’s theory of universal gravitation. A typical statement of 
Newton’s laws of motion is as follows: 

(a) All bodies continue in their state of rest or of uniform motion 
in a straight line unless they are compelled to change that state by 
external forces. 

( b ) The rate of change of momentum is proportional to the 
impressed force and takes place in the direction in which the force 
is acting. 

(c) Action and reaction are equal and opposite. 

If in a reference system a body not under the influence of any 
forces (i.e. a body far removed from all other bodies capable of 
exerting forces), moves in a straight line with constant speed, then 
Newton’s first law is valid in this reference system. Such a reference 
system is called an inertial frame. The properties of inertial frames 
are elaborated in Section 1.6. 

According to Newton’s law of universal gravitation, every particle 
of matter in the universe attracts every other particle with a force 
proportional to the product of the masses of the particles, inversely 
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A CRITIQUE OF NEWTONIAN MECHANICS 

proportional to the square of the distance between them, and 
directed along the line joining the particles. Thus, if the gravita¬ 
tional masses of two ‘point’ particles are m x and m 2 respectively, 
and if r is their distance apart, then 



( 1 . 1 ) 


where G is the gravitational constant. 

The object of a theory is to correlate laws so that a theory is more 
comprehensive than a single law. Newton was familiar with the law 
of falling bodies and with Kepler’s laws of planetary motion. 
Newton suggested that, if it is assumed that every pair of particles 
in the universe attract each other with a force given by eqn (1.1), 
then on the basis of this assumption, plus Newton’s laws of motion, 
the other individual laws can be derived and many other phenom¬ 
ena, such as the motion of the moon around the earth, can be 
interpreted. In addition to interpreting known laws, new theories 
are used to make new predictions, which should subsequently be 
tested by experiment. The degree of acceptance of a new theory 
depends largely on how well these new predictions agree with the 
experimental results. 

In a theory it is postulated that nature behaves in a particular 
way. For this reason a theory is sometimes described as a model of 
nature; it is not necessarily a mechanical model but may be a 
functional relation such as eqn (1.1). A theory should enable one 
to predict the course of an experiment from given initial conditions. 
Mathematical reasoning is used in applying a theory to a particular 
case. The conclusions of the theory obtained in this way should be 
in agreement with the experimental results. If a theory were 
perfectly correct, then there would be a one to one correspondence 
between the predictions of the theory and the course of nature. 
When applying a theory one idealizes the system and considers only 
those quantities which produce effects of the order of magnitude 
of the accuracy required, for example, when calculating the acceler¬ 
ation of a ball near the surface of the earth, in the interests of 
simplicity, one would neglect the gravitational attraction of a 
distant star, though, in principle, it would always be present. 

If a theory is to describe what happens in practice, then the 
quantities appearing in the mathematical expressions of the theory 
must relate directly or indirectly to quantities which can be 
measured in practice. The meaning of the quantities used in the 
statement of Newton’s laws of motion will now be discussed. The 
measurement of length and time relative to arbitrary standards 
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was discussed in Section 1.2. The velocity u and the acceleration a 
of a particle can be defined and measured in terms of length and 
time. Using Newton’s second law, the force f acting on a particle 
can be defined as the rate of change of the momentum of the 
particle, where the momentum of the particle is defined as the 
product of its inertial mass m and its velocity u, that is 

f= ^ (« u ) 0-2) 

Provided the inertial mass of the particle is independent of its 
velocity, eqn (1.2) can be written as 

t = m—’= ma. (1-3) 


where a is the acceleration of the particle. Hence force can be 
measured experimentally in terms of the acceleration the force 
produces on a body of known inertial mass; but before force can 
be measured in this way one must know the inertial mass of the 
body. If equal forces acted on two bodies of inertial masses m 1 
and m 2 , then, according to eqn (1.3), the ratio of the accelerations 
of the bodies would be in the inverse ratio of their inertial masses. 
Newton’s third law states that action and reaction are equal and 
opposite. Hence, if the two bodies collided, then the instantaneous 
values of the forces acting on the two bodies should be equal and 
opposite. Let a body of inertial mass m 1 moving with velocity w x 
collide head on with a body of inertial mass m 2 , which is at rest 
before the collision. Let both the bodies m 1 and m 2 move in the same 
direction after the collision with velocities u[ and u 2 respectively. 
If f x and f 2 are the instantaneous values of the forces acting on m x 
and m 2 at any instant during the collision, then according to 
Newton’s third law 

= ^2 


Integrating over the time of the collision, one obtains 


that is 


Hence, 
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that is 

m x u x = m x u[ + m 2 «2 (1.4) 

This is the law of conservation of momentum. It is important to 
realize that it follows automatically from Newton’s laws of motion. 
From eqn (1.4) 

m, u o 

— = 7- —k (1.5) 

m 2 («q — zq) v > 

Hence, if the instantaneous values of the velocities just before and 
just after collision are determined, then the inertial masses can be 
compared. One method of performing the experiment would be to 
suspend the masses as two ballistic pendulums and let them collide. 
If one of the masses, say m 1} is the primary standard of mass, then 
the inertial mass of the other particle can be determined. This mass 
can then be used as a secondary standard of inertial mass. The 
theory of Newtonian mechanics has to be used before one can say 
how the inertial masses of two bodies can be compared. Having 
defined and prescribed how the inertia] masses of a body can be 
measured, then the force acting on a body can be measured by 
observing the acceleration the force produces on a body of known 
inertial mass. Eqn (1.3) can then be used to calculate the force. 
Thus the precise meanings of some quantities such as force and mass 
depend on the theory being used, and must be measured in practice 
as prescribed by that theory. If the experimental evidence forces 
one to replace a theory by a new theory, then these quantities must 
be redefined and measured as prescribed by the new theory, and 
these definitions and procedures of measuring may differ from those 
of the old theory. It will be found that this will have to be the case 
when one comes to replace Newtonian mechanics by relativistic 
mechanics. As new theories are developed the meaning of words, 
such as mass and force, keeps changing, and the meaning to be 
attached to any word must be interpreted within the context of the 
theory being used. 

Another property of matter associated with the term mass arises 
within the context of Newton’s law of universal gravitation [eqn 
(1.1)]. The gravitational mass of a body is a measure of that 
property of matter by virtue of which every particle of matter exerts 
a gravitational force of attraction on every other particle of matter. 
Inertial mass is associated with a completely different property of 
matter, namely the fact that a force, not necessarily of gravitational 
origin, must be exerted on a particle in order to accelerate it. In 
classical theory it is an extra hypothesis, which must be tested by 
experiment, to equate gravitational and inertial mass. For a body 
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HISTORICAL INTRODUCTION—NEWTONIAN MECHANICS 
of gravitational mass m G falling near the surface of the earth the 

mQ-M-Q . , _ 

force of gravitational attraction is equal to G , where M G is 

the gravitational mass of the earth and R is the radius of the earth. 
The acceleration produced by this force is equal to the ratio of the 
force to the inertial mass of the body on which it is acting. Hence, 


(gravitational mass) GM 0 
(acceleration) — (j ne rtial mass) R? 

If the gravitational mass of a body were not proportional to its 
inertial mass, the accelerations of different bodies in the earth s 
gravitational field would not always be the same. Experimentally 
it is found that this acceleration is the same for all bodies, so that the 
experimental results require the proportionality of gravitational and 
inertial mass. If the units are chosen appropriately, then the 
gravitational and inertial masses can be made numerically equal. 
The gravitational mass of a body can be determined by weighing, 
using an equal-arm balance. This is the most convenient way of 
determining masses experimentally. To use these values in eqn 
(1.3) involves the extra hypothesis of the equality of gravitational 
and inertial mass. 

There are many other hypotheses implied in the theory of 
Newtonian mechanics and which are not emphasized when the 
subject is taught for the first time. For example, it is assumed in 
eqn (1.3) that the mass of a body is independent of its velocity; it is 
assumed that all regions of space and all directions in space are 
equivalent; it is generally assumed that Euclidean geometry can 
be used to calculate the relationships between geometrical quantities, 
and it is assumed that rigid bodies exist. It is shown in Section 1.4. 
that in Newtonian mechanics it is assumed that an absolute time 
exists. These extra hypotheses are not given a priori, and experiment 
must decide whether they are correct or not. In fact many of these 
ideas have had to be modified within the context of the theories of 
special and general relativity. 

The advance of physics is one of successive approximation, giving 
rise to better models of nature at each stage as new theories are 
developed. When applying a theory one considers an isolated 
system. The solar system is an example of a very good approxima¬ 
tion to an isolated system, since the effects from outside the solar 
system, such as the gravitational attractions of distant stars, whilst 
they are always present, are so small that they can generally be 
neglected in practice. One generally starts with the simplest 
possible system, for example, with the sun and one planet. This 
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enables the orbit of the planet to be calculated reasonably satis¬ 
factorily. In order to improve the agreement between the theory 
and observations, the perturbations due to the gravitational attrac¬ 
tions of other planets are introduced. In this way extra variables 
are introduced into the system but the same theory is used. This 
procedure generally works satisfactorily, but sometimes a state is 
reached when the old theories cannot be adjusted in this way to 
account for the experimental results. For example, using Newton’s 
theory of gravitation, the perturbations in the orbit of the planet 
Mercury due to the gravitational attractions of the other planets 
were not adequate to account for the precession of the perihelion of 
the orbit of the planet Mercury. This discrepancy could only be 
resolved by replacing Newton’s theory of gravitation by a completely 
new theory, namely Einstein’s theory of gravitation which is based 
on the theory of general relativity and to which Newton’s theory of 
gravitation is only an approximation. Similarly, the theory of 
special relativity is not a series of amendments to Newtonian 
mechanics, but a complete replacement necessitating even a 
reinterpretation of mass, length and time. Since physics advances 
by a series of successive approximations, it is foolish to say at any 
stage that the process of successive approximation is over. Therefore, 
when one comes to replace Newtonian mechanics by relativistic 
mechanics, one cannot claim infallibility for the theory of special 
relativity, but must always admit that it in turn can be replaced at 
any time by a new theory, provided the new theory is in better 
agreement with the experimental results. In fact, special relativity 
has its limitations and in non-inertial frames must be replaced by 
the general theory of relativity. 

As physics has become more complex, so the basic postulates of 
the theories have become more complex, and the basic postulates 
cannot always be tested direcdy by experiment, as was done for 
example by Cavendish in the case of Newton’s law of gravitation. 
Modern theories are often suggested to interpret the experimental 
results and are not always based directly on ‘induction’ from 
experiments. The degree of acceptance of such a theory does not 
depend only on the agreement of the fundamental postulates with 
experiment, but rather on whether or not the predictions of the 
theory as a whole agree with the experimental facts. It is shown 
later that the axioms which Einstein chose as the starting point of 
the theory of special relativity were of this type, and initially the 
whole theory had to be tested a posteriori. 

It will be shown that when the velocities of the particles involved 
are very much smaller than the velocity of light, the differences 
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between Newtonian mechanics and relativistic mechanics are 
negligible. Thus, when v < c, as is generally the case in everyday 
life, Newtonian mechanics can be used even though it is known that 
the equations of the theory of special relativity give a better model 
of nature. It is an unnecessary elaboration to use the equations of 
the theory of special relativity in these circumstances. The laws of 
Newtonian mechanics are strictly deterministic; for given initial 
conditions the predictions are always the same. This works well 
in practice for bodies large on the atomic scale, but for atomic 
particles Newtonian mechanics has to be replaced by quantum 
mechanics. 

1.4. THE GALILEAN TRANSFORMATIONS 

Newton assumed that an absolute space existed. For example, he 
wrote ‘Absolute space, in its own nature and without regard to 
anything external, always remains similar and unmovable’. 

It will be assumed, for purposes of discussion, that absolute space 
exists and that Newton’s three laws of motion hold in absolute 
space, at least to a very good approximation. Consider a co-ordinate 
system fixed in absolute space, which will be denoted by Z. It is 
useful to visualize the co-ordinate system Z as having an array of 
imaginary rulers parallel to the x, y and z axes respectively, so that, 
in principle, the co-ordinates of an event can be read directly from 
the rulers. If Newton’s first law is valid in Z, then a particle P, not 
acted upon by any forces, should travel in a straight line relative to 
Z. Let the direction of the axes of Z be chosen such that the motion 
of P is in the xy plane. If the velocity of the particle P is u and it is 
at the origin at a time t = 0, then its position at a time t should be 
given by x = u x t,y — u v t , z = 0, and its path should make an angle 
tan -1 ( u v /u x ) with the x axis. 

Now consider another co-ordinate system, denoted Z', which is 
moving with uniform velocity v relative to absolute space (Z) along 
the common x axis. Let the origins coincide at a time t — t' — 0, 
and let the directions of the y' and z' axes of Z' coincide with they 
and z axes of Z at t = 0. The co-ordinate system 27 will be considered 
as having its own series of imaginary rulers so that the co-ordinates 
of an event can be measured relative to Z '. By an event is meant 
something which happens independently of any co-ordinate system, 
e.g. if the particle P collided with another particle both an observer 
at rest in Z and one at rest in Z' would agree that the collision took 
place, though they would attribute different space co-ordinates to 
the event. 
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Newton also assumed that absolute time existed. For example, 
he wrote: ‘Absolute, true and mathematical time, of itself, and 
by its own nature, flows uniformly on, without regard to anything 
external’. 

If there were an absolute time, then time intervals should be the 
same when measured in both E and E', and since the zero of time 
in both E and E' was chosen to be the instant when the origins 
coincided, then, for all times, t = t' where t and t' are the times of an 



Figure 1.1. The measurement of the x and x' co-ordinates of an event 


event measured relative to E and E' respectively. The concept of 
absolute time was a weakness of Newtonian mechanics, since, at 
the time it was introduced, it was not possible to cite any experi¬ 
mental evidence for or against it. In the present section the concept 
of absolute time will be accepted as an integral part of Newtonian 
mechanics, even though it was introduced a priori without any 
experimental evidence. 

In order to compare the x co-ordinates of an event in E and E', 
imagine a ruler laid out along the common x axis as shown in 
Figure 1.1 (the ruler can be moving relative to both E and E'). If 
time were absolute, observers at rest in E and E' respectively would 
always agree on the time an event happened, so that both would 
agree that 0 the origin of E coincided with a certain position X x on 
the ruler, when O' the origin of E' coincided with a position X 2 on 
the ruler, and the x and x' co-ordinates of the event coincided with 
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another position X 3 , as illustrated in Figure 1.1. Hence, if time 
were absolute, the observers at rest in E and E' should both agree 
that 

* 1*3 = * 1*2 + * 2*3 

If it is assumed that x and x' are measured in the same units, then 

x= 00' + x' 

If v is the relative velocity of the co-ordinate frames, then 00 is 
equal to vt. Hence 

x' — x — vt 
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Figure 1.2. The measurement of they andy' co-ordinates of an event 


If the ruler were placed in the y direction as shown in Figure 1.2, 
then both observers should agree that their x and *' axes coincided 
with a position Y 1 on the ruler at the same time as the y and y 
co-ordinates of the event coincided with the position Y 2 , so that both 
observers would record the same numerical value for the y co¬ 
ordinates of the event, that is 


Similarly, 


y =/ 

z = z' 


Collecting the transformations, one has: 


x r 

= x — vt or x = x' + vt 

(1.6) 

y 

=y 

(1.7) 

z r 

= z 

(1.8) 

t' 

= t 

(1.9) 
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These transformations are generally called the Galilean trans¬ 
formations. 

In carrying out a measurement of the length of a moving object, 
the observers at rest in £ and £' would observe the positions of the 
ends of the object on a ruler measuring both ends at the same time. 
If time were absolute they would agree on simultaneity. Let one 
end of the moving object be at a point x v y v z 1 at a time i and let the 
other end of the moving object be at the point x 2 , y 2 , z 2 in £ at the 
same time t. If the co-ordinates of these events are x[, y[, z{ and 
4, 4> z 2 respectively in £', according to the Galilean transfor¬ 
mations, if the events are measured at the same time in both £ 
and £', 

(*a - *i) 2 + (J’a - Jh) 2 + ( z 2 - z i) 2 

= (4 + vt — Xi — vt) 2 + (y' 2 —y[) 2 + (z’ z — 4) 2 

= (4 — 4) 2 + (4 — 4) 2 + (4 — 4) 2 

Thus the length of a moving object is absolute in Newtonian 
mechanics, that is, it has the same numerical value in £ and £'. 
It can be seen that this result follows from the postulate of absolute 
time. 

Class Question— How would you compare the times of two events, one on 
the earth and one on the moon? How would you synchronize a clock 
on the moon? (Comment: If you use radio signals, what assumption do 
you make about the speed of light in different directions of empty space ? 
If you make any assumption at all about the speed of light in different 
directions, you are using a theory to help to define how to compare the 
times of spatially separated events.) 

Now consider the particle P which is moving with uniform velocity 
u in £. At a time t it has co-ordinates given by 

x = u x t\ y = u v t; z — 0 

Using eqns (1.6), (1.7) and (1.8), the co-ordinates of P measured in 
£' at a time t = t' are 

x' — u x t — vt; y = u y t; z' — 0 

The particle P, which is at the origin of £' at t' = 0 moves in the 
x'y' plane in £', such that 

x' x — vt xjt — v u x — v 

y'~ y ~ y/t ~ 

Since u x , u y and v are all constants, x’jy’ is a constant, and the 
particle P moves in a straight line in £', making an angle of 
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tan -1 [_(u x — v ) Iu y ] with the y' axis. Thus, Newton’s first law applies 
in Z', if it applies in Z. 

So far only uniform motion has been considered. It will now 
be assumed that the particle P has an acceleration a relative to Z. 
Differentiating eqn (1.6) with respect to time, 


dx' dx 
dt dt 


or a' = u x 


v 


( 1 . 10 ) 


Similarly, differentiating eqns (1.7) and (1.8), 

Up Z= Uy (l.ll) 

and «; = «, (i.i2) 

These are the velocity transformations. They show that the value 
of the velocity of a particle depends on the co-ordinate system chosen, 
that is, on the standard of rest chosen. 

Differentiating eqns (1.10), (1.11) and (1.12) with respect to 
time, since v is a constant, one obtains 


du' x 

dt 


da* 

dt 


du'y 


du„ 


that is 


dt dt 


a == a 


da' 

dt 


da z 

dt 


(1.13) 


In eqn (1.13) a' is the acceleration of the particle relative to Z' and 
a is its acceleration relative to Z. Provided v, the relative velocity 
of Z and Z', is constant, the accelerations are the same in Z and Z'. 
It is assumed in Newtonian mechanics that the mass of a particle 
is independent of its velocity, that is, mass is an invariant. Multi¬ 
plying both sides of eqn (1.13) by m, 

ma! = ma (1-14) 

Since the force acting on a particle can be defined as the product of 
the mass of the particle and the acceleration produced by the force, 
ma! is equal to the force producing the acceleration of P measured 
in Z', whilst ma is the value of the force acting on the particle 
measured in Z. Hence f f' (115) 


Thus the force on the particle, measured in terms of the accelera¬ 
tion it produces, should have the same value in both Z and Z'. 

Force is not always measured in terms of the acceleration it 
produces. For example, if a body is suspended by an elastic string, 
the force of gravitational attraction is opposed by the elastic restor¬ 
ing force. If the body is at rest, it is not convenient to measure the 
elastic force in terms of an acceleration. The way the concept of 
force is generally used in dynamics can be illustrated by the 
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following example. If one wanted to calculate the acceleration of 
a body falling under gravity in the vicinity of the moon, one would 
assume that the force of gravitational attraction between the 
particle and the moon is given by eqn (1.1). Then using this value 
of force in the equation, force equals mass times acceleration, the 
acceleration of the body could be calculated. Thus, in Newtonian 
mechanics, it is sometimes assumed that the force acting on a 
particle is known independently of the acceleration of the particle. 
Using Newton’s second law as a definition of force, it was concluded 
that if mass is absolute, force is absolute also. In Newtonian 
mechanics the equation f =f is extended to all types of forces; 
for example, it is assumed that the force due to gravitational 
attraction, given by eqn (1.1), is an invariant. Similarly, it is 
assumed that elastic forces are invariant. It has been shown that 
acceleration is an invariant. Hence, if mass and force are invariants, 
the equation force equals mass times acceleration can be used in any 
inertial frame, and under these conditions Newton’s second law is 
invariant under a Galilean transformation. 

At this stage the reader will have realized that there are a large 
number of assumptions such as absolute time, absolute mass, ab¬ 
solute force in Newtonian mechanics. However, within the limita¬ 
tion that all velocities are very much smaller than the velocity of 
light, Newtonian mechanics works well in practice. As an example 
consider a body falling olf the top of the mast of a ship moving with 
uniform velocity. It is assumed in Newtonian mechanics that the 
force on the body is given by G(mM/R 2 ) = mg relative to both the 
ship and the seashore. Newton’s second law can be used to calculate 
the acceleration, and hence the motion relative to either the ship or 
the seashore. The co-ordinates and time at any point on the path 
of the particle relative to the ship and the seashore are connected by 
the Galilean transformations. 

If two particles collide or act on each other, then, if Newton’s 
third law is valid in S (absolute space), action and reaction would 
be equal and opposite in S. This means that the forces of the 
colliding particles on each other would always be equal and 
opposite when measured in S. According to eqn (1.15) these 
forces would have precisely the same numerical values when 
measured in S' as they have in S. If Newton’s third law is valid in 
S, then it is also valid in S'. 

It has been shown that if Newton’s three laws of motion hold in 
absolute space (S), then they should hold in any co-ordinate system 
S' moving with uniform velocity relative to absolute space (S), if 
it is assumed that time, mass and force are absolute. Any one of 
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these co-ordinate systems could be used for recording mechanical 
phenomena and the results obtained could be interpreted in terms 
of Newton’s laws of motion. If the positions and velocities of the 
particles in a physical system are given relative to S', from the 
forces acting on the particles, which are the same as in S, one can 
calculate the accelerations of the particles and calculate the sub¬ 
sequent motions of the particles relative to S' without any reference 
to absolute space. Furthermore, when Newton’s laws are applied 
in one of the co-ordinate systems moving with uniform velocity 
relative to absolute space, the laws contain no term containing the 
velocity of the co-ordinate system relative to absolute space, but the 
laws take the same mathematical form in any one of the co-ordinate 
systems moving with uniform velocity relative to absolute space. 
From measurements carried out in any one of these moving co¬ 
ordinate systems, the existence of absolute space cannot be proved. 
Absolute space therefore loses its significance for linear motion. 
Thus, Newton’s laws satisfy the principle of relativity when the 
co-ordinates and time are changed according to the Galilean 
transformations. Some absolute quantities still remain in Newtonian 
mechanics, for example, absolute time and absolute mass. 

The points discussed in this section can be illustrated by consider¬ 
ing the example of a ship going out to sea with uniform velocity on 
a calm day. It would be quite possible under these conditions to 
mark out a tennis court and to play a game of tennis on the deck 
of the ship, and, if there were no accelerations whatsoever, the 
players would not realize that the ship was moving without looking 
at something external to the ship. No experiment performed inside 
the ship, without reference to anything external to the ship, would 
give the velocity of the ship relative to the shore, since the laws of 
mechanics, determined from the experiments carried out inside the 
ship would not include a term containing the velocity of the ship 
relative to the earth. It would be perfectly satisfactory to interpret 
the game of tennis from the ship. An observer sitting on the seashore 
could also watch the game. Within the context of Newtonian 
mechanics the observer on the shore would agree with an observer 
on the ship that Newton’s laws of motion were valid, at least to a 
good approximation. He would also agree on the dimensions of the 
court, the time at each stage of the game and on the score, but the 
two descriptions would not be completely identical. The observer 
on the shore would say [following from eqns (1.10), (1.11) and (1.12)] 
that the ball left one player’s racket faster than the other’s, if the 
ball came off both rackets at the same speed relative to the ship. 
The reader will probably accept it as quite natural that the speed 
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of the ball will be different, when the standard of rest is changed. 
The relations between the observer on the ship and on the shore are 
perfectly reciprocal. If an observer on the ship watched a game of 
tennis on the beach the role of the observers would be interchanged. 

Newton realized that the concept of absolute space was not 
necessary for linear motion, e.g. he wrote: 

And thus we use, in common affairs, instead of absolute places and 
motions, relative ones; and that without any inconvenience. But in 
physical disquisitions, we should abstract from the senses. For it may be 
that there is no body really at rest, to which the places and motions of 
others can be referred. 

Newton felt that absolute rotations could be observed, e.g. he wrote: 

The effects by which absolute and relative motions are distinguished 
from one another, are centrifugal forces, or those forces in circular 
motion which produce a tendency of recession from the axis. 

Co-ordinate systems accelerating relative to an inertial frame will 
now be discussed. 

1.5. NEWTON’S LAWS OF MOTION IN A CO-ORDINATE 
SYSTEM ACCELERATING OR ROTATING RELATIVE 
TO AN INERTIAL FRAME 

It was pointed out in Section 1.4 that one could play a game of 
tennis on a ship provided the ship moved with uniform velocity. 
However, once the ship encountered rough seas and started to 
undergo sudden accelerations, one would soon realize that the laws 
of motion are affected by the acceleration of the reference system 
relative to the earth. It will now be shown how Newton’s laws of 
motion must be modified, if one chooses as one’s standard of rest 
a co-ordinate system accelerating or rotating relative to an inertial 
frame. It will again be assumed that there is an inertial frame 2 
in which Newton’s laws are valid, at least to a good approximation. 
If there are no forces acting on a particle P then it will move in a 
straight line in 2. Consider a co-ordinate system (denoted A) which 
has a uniform linear acceleration a r relative to 2 directed along the 
common x axis. Let the origins coincide at t = t' = 0 and let the 
co-ordinate systems be at rest relative to each other at this instant. 
It will be assumed that time is absolute, that is, that t = t'. Choose 
the directions of the y and z axes of 2 and the y' and z axes of A 
such that the particle P moves in the xy plane in 2 with a uniform 
velocity u having components u x , u y , u z = 0. If the particle P were 
at the origin of 2 at t = 0 then after a time t, it should be at the 
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point * = uj, y = u y t, z = 0 in E. Its path in E is given by the 

u v 

equation y = — x. 

U X 

At a time t the * co-ordinate of the origin of A relative to E is 
equal to \a T t 2 . If x' is the co-ordinate of the particle P relative to A 
at the time t' = t, then 

x = x' + 

that is 

x' = x — ia r t 2 = uj — £a r t 2 (1.16) 

Similarly, 

y =y = u v t (1-17) 

z' = z = u z t (= 0 in the present case) (1.18) 


Substituting t =y'/u v from eqn (1.17) into eqn (1.16), 


7 / " 2 u% r 


Relative to the co-ordinate system A the particle P moves in a 
parabola and not in a straight line even though no real or impressed 
forces act on P. Hence, Newton’s first law does not hold in the co¬ 
ordinate system A, and the particle P appears to have an accelera¬ 
tion —a r relative to A; thus A is not an inertial frame. If one 
wanted to apply Newton’s first law in the co-ordinate system A, 
then one would have to introduce a fictitious force — ma r acting on the 
particle P of mass m in order to interpret its apparent acceleration. 
This fictitious force is called an inertial force; it does not produce 
an acceleration relative to the fixed stars. As an example of the use 
of a fictitious inertial force, consider a passenger on a moving train. 
If the passenger dropped a ball from the ceiling of the carriage, then 
the ball would move in a parabola relative to an observer at rest on 
the earth. If the train were moving with uniform velocity, the ball 
would appear to fall vertically downwards relative to the passenger 
on the train. If the train were accelerating, the ball would not 
move in a straight line relative to the passenger on the train, but 
would appear to be deflected from the vertical in a direction 
opposite to the direction of the acceleration of the train. If the 
passenger on the train wanted to interpret this motion in terms of 
Newton’s laws of motion, he would have to introduce the fictitious 
inertial force -ma, in order to account for the deflection of the ball 
from the vertical. This fictitious force produces no acceleration 
relative to the fixed stars. Another example of linear acceleration 
is a mass resting on the floor of a lift which is itself falling freely 
under gravity. 
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A different type of accelerated motion is shown in Figure 1.3(a). 
A particle P, of mass m, is attached by a string to the point 0, and 
is rotating on a smooth horizontal table with uniform angular 
velocity co about an axis through 0, as shown in Figure 1.3(a). 
Relative to the earth, which will be considered as an approximate in¬ 
ertial Frame, the particle P has an acceleration ra> 2 directed towards 
O. In order to give rise to this acceleration there must be a force 
equal to mrco 2 acting on the particle P. This centripetal force is due 
to the pull of the string on the mass m. According to Newton’s 


y 


X 


(Inertial frame) 


A 


Co-ordinate system rotating with\ 
angular velocity co relative to X ) 



Centripetal force 
= mrco 2 
Spring balance to 
measure the tension 
in the string 


Fictitious centrifugal 
force = mrco 2 


Centripetal 
force = mrco 2 . 



P is at rest in A 


Spring balance 
to record tension 


(a) 


(b) 


Particle P rotates with uniform angular velocity to, on a smooth horizontal 
table; {b) in the co-ordinate system A which rotates with P, a fictitious centrifugal force 
mrco must be introduced if one wants to apply Newton’s laws of motion in the rotating 

co-ordinate system h 


third law there is an equal and opposite force on the string. This 
force arises from the fact that the particle P is tending to carry on in 
a straight line relative to the earth, that is, it is tending to fly off at 
a tangent to the circular motion. The string keeps the particle P 
moving in a circle. The tension in the string can be measured by 
a spring balance. Now consider a co-ordinate system, denoted A, 
rotating with the particle P, such that, relative to A, P is at rest 
as shown in Figure 1.3(b). There is still a centripetal force equal to 
mrw acting on P and directed towards 0 due to the tension in the 
string; this force will still be measured by the spring balance. If 
one wanted to apply Newton’s laws of motion in the rotating co¬ 
ordinate system one would have to introduce a fictitious inertial 
force ^ mrm 2 acting outwards; there would then be no resultant 
force on P, and P would remain at rest in the rotating co-ordinate 
system. This fictitious outward force is sometimes called the centri¬ 
fugal force. It produces no acceleration relative to the fixed stars. 
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vacuum, where there was nothing external or sensible with which the 

globes could be compared. . . . 

According to Einstein’s theory of general relativity, the laws of 
motion (in a gravitational field) can be expressed in a form which is 
the same in all reference frames, whether they are accelerating or 
not. Apart from Chapter 9, this book is concerned almost ex¬ 
clusively with the theory of special relativity, which deals only with 
inertial frames of reference. 

1.6. INERTIAL REFERENCE FRAMES 

The question arises, how can one select a co-ordinate system in 
which Newton’s laws of motion are valid ? If in a reference frame a 
particle under the influence of no forces (i.e. a particle far away 
from any other particles capable of exerting forces) travels in a 
straight line with constant speed, then Newton’s first law is valid 
in this reference frame. This reference frame is then considered a 
suitable reference frame for the application of Newton’s laws of 
motion. Such a reference frame is called an inertial reference frame, 
or sometimes a Galilean reference frame. Now it is not always 
possible to remove all other bodies, so the concept of inertial 
frames must be examined a little more closely, especially within 
the context of the theory of special relativity. It is generally only 
necessary to use the theory of special relativity when the velocities 
of the particles are comparable with the velocity of light. The only 
particles, available in the laboratory, which have such speeds are 
atomic particles. For atomic particles the electric forces are very 
much greater than the gravitational force of attraction; for example, 
the ratio of the electric to the gravitational force between two 
a-particles is ~10 35 . Hence, for atomic particles, if all neighbouring 
bodies are electrically neutral and produce no magnetic fields, the 
effects of neighbouring bodies are negligibly small. It is possible to 
carry out experiments on the motions of ions in the absence of 
electric and magnetic fields and it is found experimentally that the 
ions do travel in straight lines (subject of course to the limitations 
due to the Uncertainty Principle and to the small gravitational 
attraction of the earth which can generally be neglected). Hence, 
for most atomic and nuclear processes studied in the laboratory, 
the co-ordinate system in which the earth is at rest, that is the 
laboratory system, is a sufficiently good approximation to an ideal 
inertial reference frame. For low velocity mechanical phenomena 
the reference system in which the earth is at rest is a satisfactory 
approximation to an inertial frame, provided the force of gravity 
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is considered as an impressed force. In this system, generally called 
the laboratory system, Newton’s laws can be used to make accurate 
predictions. However, the earth is rotating about its axis and about 
the sun. In very accurate works these rotations have to be taken 
into account, e.g. with Foucault’s pendulum experiment. A better 
approximation to an inertial frame is the reference frame at rest 
relative to the ‘fixed’ stars. These stars may have some irregular 
motions, but this reference frame, or one moving with uniform 
velocity relative to it, is a sufficiently good approximation to an 
ideal inertial frame for all terrestrial phenomena. For cosmological 
problems the choice of one single inertial frame to describe the 
whole universe is not appropriate, and the concept of frames of 
reference must be refined and extended within the context of the 
general theory of relativity. 

PROBLEMS 

Problem 1.1 —Give an account of the standards of mass, length and time. 
How would you set up similar standards in a rocket moving with uniform 
velocity relative to the earth? 

Problem 1.2— Distinguish between inertial mass and gravitational mass 
within the context of Newtonian mechanics. Discuss the experimental 
evidence for the proportionality of the inertial and gravitational mass of a 
body. (Include a discussion of Efjtvbs’ experiment.) 

Problem 1.3 —If you were locked in a cave and had no contact with the 
outside world, what experiments, if any, could you perform to show that 
(a) the earth has a linear velocity in its orbit around the sun, and (b) the 
earth is rotating. Could you determine your geographic latitude? [Hint: 
For part (b), use either Foucault’s pendulum or an accelerometer of the 
type described by Struve, Elementary Astronomy, p. 51.] 

Problem 1.4 —Show that Newton’s laws of motion obey the principle of 
relativity when the co-ordinates and time are transformed according to the 
Galilean transformations. State clearly all the assumptions you make. 

Problem l.S —A stone is dropped from rest from the mast of a ship moving 
with a velocity of 15 m/sec. Choose the origins of E (the laboratory frame 
and 2" (the co-ordinate system in which the ship is at rest) such that the 
origins coincide with the stone at the instant t = 0 when the stone is dropped. 
Show that the path of the stone relative to Z' (the ship) is a straight line 
given by x' = 0; y' = —gt 2 . Show that if the ship moves along the x axis, 
then relative to the laboratory the stone moves in a parabola given by 
x = 15 t,y = —gt 2 . Show how these equations are related by the Galilean 
transformations. 

Problem l.S —A particle is suspended from the roof of a carriage of a train 
moving with uniform velocity. If the particle moves in a circular motion 
relative to the train (e.g. a conical pendulum), use the Galilean transforma¬ 
tions to show that its path relative to the earth is a cycloid. 
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Problem 1.7— Discuss what ‘fictitious’ inertial forces must be introduced 
into Newtonian mechanics, if we choose as our standard of rest a co-ordinate 
system rotating relative to the fixed stars. 

Problem 1.8 —A particle is dropped vertically towards the earth. Show 
that after a time t it is deviated to the east by an amount ( cogt 3 /3 ) cos A where 
co is the angular velocity of the earth, g the acceleration due to gravity and A 
is the geographic latitude. (Reference: Joos, Theoretical Physics, 1st edition, 
p. 223.) 

Problem 1.9— Show that the plane of vibration of a Foucault pendulum 
rotates with a velocity co sin A, where A is the latitude and co is the angular 
velocity of the earth. (Reference: Struve, Elementary Astronomy, p. 48.) 

Problem 1.10— A particle is at rest relative to the earth. Relative to an 
observer on a rotating roundabout the particle appears to be rotating with 
angular velocity co. Relative to the roundabout the fictitious centrifugal 
force mrco 2 acts outwards. How is the particle kept moving in a circle 
relative to the rotating frame of reference? [Hint: Remember the Coriolis 
force.] 

Problem 1.11 —Show that the value of the acceleration due to gravity 
measured by a simple pendulum at sea level varies by about 0-3 per cent 
between the poles and the equator due to the rotation of the earth, [Hint. 
Show that at the equator, when the centrifugal force is taken into account, 
g = 2 = g 0 (l - co 2 Rlg 0 ), where g 0 is the acceleration due to gravity at the poles, 
co = 7-29 x 10~ B rad/sec is the angular velocity and R = 6,367 km is the 
radius of the earth.] 
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HISTORICAL INTRODUCTION 
II—THE RISE OF THE THEORY 
OF SPECIAL RELATIVITY 

2.1. INTRODUCTION 

A brief review of the historical development of the theory of special 
relativity is given in this chapter. A fuller, more balanced account 
is given in the author’s other text, An Introduction to the Theory of 
Relativity, (Butterworths, London, 1964), where full references to 
other works are given. In order to put the theory of special relativity 
into historical perspective, it is necessary to mention the theories of 
the ether developed in the nineteenth century. Details of experi¬ 
ments are relegated to Appendix 1. The reader should realize at 
the outset, that the mechanical theories of the ether turned out to 
be one of the biggest red herrings in the history of science. The 
reader should treat this chapter as a short incursion into the history 
of science, interpreted with hindsight and a modern bias. The 
reader should gloss quickly over this Chapter, and proceed in 
Chapter 3 to cultivate the relativistic point of view. The important 
fact for the reader to realize is that the theory of special relativity 
evolved from classical optics and electromagnetism. In the theory 
of special relativity, what Einstein 1 did was to extend the principle 
of relativity to the laws of classical optics and electromagnetism. 
This necessitated abandoning the Galilean transformations and 
re-interpreting the classical concepts of space and time. With just 
this amount of background, some readers may prefer to proceed 
directly to Chapter 3, where the theory of special relativity is 
developed from two main axioms, namely, the principle of relativity 
and the principle of the constancy of the speed of light. The whole 
theory can be checked a posteriori by showing that the predictions of 
the theory agree with experiment. Alternatively, one can now ap¬ 
proach special relativity via high-speed mechanics (cf. Appendix 4). 

2.2. THE LUMINIFEROUS ETHER 

After the initial success of Newtonian mechanics and Newton’s 
theory of universal gravitation, it seemed plausible to try and 
explain all natural phenomena in terms of mechanics, particularly 
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as the concepts of mechanics were familiar to people in their daily 
lives. This aim at a mechanical interpretation of all natural 
phenomena started with Descartes (born 1596) and continued up 
to the end of the nineteenth century. For example, towards the 
end of this period Lord Kelvin wrote: ‘I am never content until I 
have constructed a mechanical model of the object I am studying. 
If I succeed in making one, I understand; otherwise I do not’. This 
approach proved very successful in the case of sound. Sound was 
interpreted as the propagation of longitudinal waves in a material 
medium. Sound will not travel through a vacuum, a material 
medium being essential for its transmission. 

In the same way attempts were made to interpret light in terms 
of mechanical models, and two theories arose. One was the cor¬ 
puscular theory in which light was pictured as a stream of little 
corpuscles. It was assumed that these small particles obeyed the 
laws of mechanics and produced the sensation of light when they 
struck the eye. The other theory was the wave theory. It was 
developed, initially, by Hooke, and was formulated in detail by 
Huygens. On this theory, light was considered as a form of wave 
motion propagated through space. At the time of Newton, and 
for some time afterwards it was assumed that the waves constituting 
light were longitudinal waves. It is not possible to account for the 
polarization of light in terms of longitudinal waves. At the time, 
Newton was able to account for polarization more satisfactorily in 
terms of the corpuscular theory. He felt that the corpuscular theory 
was also able to account for the rectilinear propagation of light 
better than the wave theory, since on the wave theory light should 
be diffracted around corners in the same way as sound is; the 
wavelength of light had not been determined at that time. The 
corpuscular theory remained pre-eminent until the beginning of 
the nineteenth century, when Young investigated the interference of 
two beams of light. Young and Fresnel were able to account for the 
newly observed phenomena of interference and diffraction on the 
wave theory, and from that time onwards the wave theory of light 
came to be accepted. Since sound will not travel through a vacuum 
and must have a material medium to transmit it, it was reasonable 
to assume that, if light was a form of wave motion, then there had to 
be some medium present in a vacuum as well as inside material 
bodies, which was able to transmit the vibrations constituting light. 
This medium was called the ether. 

When considering the ether theories of light, it is helpful to use 
the theory of the transmission of sound as an analogy. In this 
analogy, the air or material body which transmits the sound waves 
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corresponds to the light wave. Now the velocity of sound in air is 
independent of the velocity of the source of sound, but it does 
depend on the strength and direction of the wind. If V is the speed 
of sound in still air and w is the speed of the wind, according to the 
Galilean velocity transformations, the speed of sound varies from 
V + w to V — w when the sound travels in the direction of the wind 
and in the direction opposite to it respectively. By analogy with 
sound, it was assumed in the ether theories that the speed of light 
was independent of the speed of the source of light. If the earth 
were moving relative to the hypothetical light transmitting medium, 
that is the ether, there would be an ether ‘wind’, relative to the 
earth. By analogy with sound, it was believed that this ether ‘wind’ 
should affect the velocity of light, even in empty space. If c is the 
speed of light when the ether is at rest and w is the speed of the ether 
‘wind’ relative to the earth, according to the Galilean velocity 
transformations, eqns (1.10), (1.11) and (1.12), the speed of light 
should vary between c + w and c — w when c is parallel to w and 
c is antiparallel to w respectively. Thus, in principle, by measuring 
the speed of light in different directions in space, one should have 
been able to determine the motion of the earth relative to the 
absolute reference system in which the hypothetical ether was 
assumed to be at rest. 

Class Question —In your course on physical optics, what type of wave 
motion in what type of medium did the wave equation you used represent ? 
(Comment: At school level the answer to the above questions would 
probably be that the reader had been taught that light was a form of 
wave motion, but the question of what type of motion in what type of 
medium was probably never considered. By analogy with sound and water 
waves, such a reader would probably accept that the speed of light should 
be independent of the speed of the source. More advanced readers will 
probably think of light as an electromagnetic wave. In this case also the 
reader would accept that the speed of electromagnetic waves is independent 
of the speed of the source, cf. Appendix 2.) 

Many readers may prefer not to think in terms of a mechanical 
light transmitting ether. Such readers can identify the hypothetical 
absolute system simply as a co-ordinate system in which the speed 
of light in empty space is the same in all directions, as shown in 
Figure 2.1{a). According to the Galilean velocity transformations, 
in any other reference frame moving with uniform velocity relative 
to this absolute system, the speed of light in empty space should be 
different in different directions, as shown in Figure 2.1(b). Due to 
its rotation about its axis and about the sun, it is unlikely that the 
earth is always at rest relative to such an absolute system, if such a 
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system exists. Thus by measuring the speed of light in different 
directions, one should have been able to determine the motion 
of the earth relative to the hypothetical absolute system. The most 
famous of these optical experiments was the Michelson-Morley 
experiment (1887). A full account of this experiment is given in 


2 (Absolute system) 


y 


(Speed of light is the 
same in all directions) 



c 


c 


c 



c 




(Moving system) 

(—*v Relative to the 
absolute system) 




C + V I c-v 



(a) (b) 

Figure 2.1. It is assumed that there is an absolute system E, in which the speed of 
light in empty space is the same in all directions, as shown in Figure 2.1(a). IJ it is 
assumed that the Galilean velocity transformations are correct, the speed of light in 
empty space should be different in different directions in the moving system E', shown in 
Figure 2.1(b). For example, for light parallel to the +x axis in E, u x — +c. 
Hence, from eqn (1.10), a' = u x — v = c — o. For light in the negative ex 
direction in E, u x = —c. From eqn (1.10), in E we have a' = u x — v = 
— c — V = — (c + v) as shown in Figure 2.1(b). Experiments failed to detect 
any variation in the speed of light in different directions relative to the earth. In the 
theory of special relativity, it is assumed that the speed of light in empty space is 
the same in all directions relative to E' as well as E. This means abandoning the 
Galilean transformations. This postulate is reasonable, if the laws of optics obey 
the principle of relativity, since there is then nothing to differentiate E'from E, if the 
speed of light is independent of the speed of the source, and if space is isotropic. 
Thus the speed of light in empty space should just as likely be the same in all 
directions in E' as in E. Predictions based on the assumption that the speed of light 
in empty space is the same in all directions in both E and E' have proved to be in 
accord with experiment 


Appendix 1. All these optical experiments failed to measure any 
variation in the speed of light in different directions of empty space, 
and failed to identify any absolute system for the laws of optics. 
Various ingenious interpretations of the null result of the Michelson- 
Morley experiment were suggested at the time. They are reviewed 
in Appendix 1. The interpretation which has come to be accepted, 
following the development of the theory of special relativity, is that 
the speed of light in empty space is the same in all directions in all 
inertial reference frames. (This assumes that the speed of light is 
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independent of the speed of the source.) For example, according 
to this interpretation the speed of light should be the same in all 
directions in empty space in both Figures 2.1(a) and 2.1(b). This 
assumption means that the Galilean transformations must be 
abandoned, and it leads to the re-interpretation of the classical 
ideas of space and time developed in Chapter 3. The failure to 
determine any absolute reference frame for the laws of optics was 
interpreted by Einstein 1 , by postulating that the laws of optics 
obeyed the principle of relativity. For example, according to 
Einstein, if one were in a ship moving with uniform velocity, 
relative to the earth, and if one carried out optical experiments, 
one should develop the same laws of optics on the basis of these 
experiments, as if the experiments were carried out in a laboratory 
at rest relative to the earth. It should be impossible to determine 
the speed of the ship by means of optical experiments, without 
looking at something external to the ship. If the speed of light in 
empty space is independent of the speed of the source of light, and 
provided space is isotropic (that is, all directions are equivalent) 
and if there is no absolute system for the laws of optics, the speed of 
light in empty space should be just as likely to be in the same in all 
directions relative to the ship as relative to the earth. Furthermore, 
provided the same units of length and time are used, the numerical 
value of the speed of light in empty space should be the same 
relative to both the ship and the earth. If the speed of light in 
empty space relative to the ship depended on the speed of the ship 
relative to the earth, one could carry out experiments on the 
ship to see how the speed of light depended on the speed of the ship. 
Thereafter, by measuring the value of the speed of light relative 
to a moving ship, one could determine the speed of the ship relative 
to the earth, without looking at anything external to the ship. 
This is contrary to the assumption that the laws of optics obey the 
principle of relativity. Hence the speed of light in empty space 
should have the same numerical value relative to the earth and 
the ship. This is the principle of the constancy of the speed of light. 
This is one of the axioms of the theory of special relativity. Pre¬ 
dictions based on this axiom have proved to be in agreement with 
experiments. 

At the beginning of the twentieth century, when the theory of 
special relativity was developed, the wave theory of light was 
accepted. The old corpuscular theory, in which it was assumed that 
light consisted of corpuscles, which obeyed the laws of Newtonian 
Mechanics, had been abandoned for some time. The quantum 
theory of light, in which it is assumed that light consists of photons 
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(or light quanta) of energy hv, where h is Planck’s constant and v 
is the frequency of the light, was developed at the same time as 
the theory of special relativity. Actually, Einstein interpreted the 
photoelectric effect in terms of photons (light quanta) in 1905, 
the same year as he introduced the theory of special relativity. The 
resolution of the dual nature of light (wave theory and photon 
model) had to await the development of quantum mechanics in the 
1920’s. The development of the photon model of light did not 
influence the historical development of the theory of special rela¬ 
tivity to any large extent. The properties of photons were shown to 
be consistent with the theory of special relativity. There is now 
direct experimental evidence to show that the speeds of photons, 
such as y-rays, are independent of the speeds of their sources, (cf. 
Sections 3.3 and 5.8.6). 

2.3. MECHANICAL THEORIES OF THE ETHER 

In the nineteenth century, the transmission of electric and magnetic 
forces through empty space was also interpreted in terms of a 
hypothetical ether. Following the work of Faraday and Maxwell, 
it was assumed that electric and magnetic field lines gave rise to 
tensions in, and pressures on, elements of the hypothetical ether 
which transmitted, by contiguous action the forces between electric 
charges. The second half of the nineteenth century saw the 
synthesis of classical electromagnetism and optics in Maxwell’s 
equations. Light was interpreted as an electromagnetic wave 
transmitted by the ether. The existence of electromagnetic waves 
was confirmed experimentally by Hertz. At that time, attempts 
were made to interpret all electric, magnetic and optical phenomena 
in terms of mechanical properties attributed to a single ether. In 
1867, Lord Kelvin suggested that the atoms of matter were vortices 
in the ether. In order to be so comprehensive and all embracing 
in their coverage, the mechanical models of the ether became more 
and more complicated. To quote Born 2 (1924) 

If we were to accept them literally, the ether would be a monstrous 
mechanism of invisible toothed wheels, gyroscopes and gears intergripping 
in the most complicated fashion, and of all this confused mess nothing 
would be observable but a few relatively simple forces which would 
present themselves as an electromagnetic field. 

Towards the end of the nineteenth century the view was beginning 
to arise that one should merely accept that the laws of electro¬ 
magnetism describe the electromagnetic forces between moving 
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charges, and one should not try to interpret the electromagnetic 
forces themselves in terms of a mechanical ether, whose properties 
could not be measured. The view that prevailed after the rise of 
the theory of special relativity may be summarized by the following 
quotation from Born 2 . 

Light or electromagnetic forces are never observable except in connection 
with bodies. Empty space free of all matter is no object of observation at 
all. All that we can ascertain is that an action starts out from one material 
body and arrives at another material body some time later. What 
occurs in the interval is purely hypothetical, or, more precisely expressed, 
arbitrary. This signifies that theorists may use their own judgement in 
equipping a vacuum with phase quantities (denoting state), fields, or 
similar things, with the one restriction that these quantities serve to 
bring changes observed with respect to material things into clear and 
concise relationship. 

This view is a new step in the direction of higher abstraction and in 
releasing us from common ideas that are apparently necessary components 
of our world of thought. At the same time, however, it is an approach 
to the ideal of allowing only that to be valid as constructive elements of 
the physical world which is directly given by experience, all superfluous 
pictures and analogies which originate from a state of more primitive 
and more unrefined experience being eliminated. 

From now onwards ether as a substance vanishes from theory. In its 
place we have the abstract ‘electromagnetic field’ as a mere mathematical 
device for conveniently describing processes in matter and their regular 
relationship. 

The equations of electromagnetism give the electric and magnetic 
fields due to a system of moving charges at a point in space. From 
these fields the force that would act on a test charge placed at that 
point in empty space can be calculated using the Lorentz force. 
It is the effects of the forces on charges that are observed experi¬ 
mentally. It must be pointed out that classical electromagnetism 
is correct only in so far as all effects, which arise from the finite 
value of Planck’s constant h, may be neglected. Classical electro¬ 
magnetism has to be extended to incorporate the effects of quantiza¬ 
tion. 

Nowadays, instead of interpreting electromagnetic forces in 
terms of mechanical models, we try to interpret the mechanical 
properties of solids and fluids in terms of atomic theory and quan¬ 
tum mechanics using electromagnetic forces. 

For the benefit of readers familiar with quantum mechanics a 
very brief review will now be given of how the wave properties of 
light are nowadays associated with individual photons, rather than 
with a transmitting medium. Nowadays, photons (light quanta) 
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are treated as one species of the fundamental particles. According 
to quantum mechanics, wave properties are associated with indi¬ 
vidual photons, in a similar way to the way ‘waves’ are associated 
with individual electrons. In both cases a wave equation can be 
used to make estimates of the probability of finding a particle 
(photon or electron) at a given point. The reader will undoubtedly 
accept that an electron and its associated ‘waves’ can traverse a 
vacuum, without a transmitting medium. Similarly, by analogy 
with an electron, it is reasonable to accept that a photon, such as 
a y- ray, and its associated ‘waves’ can cross a vacuum without a 
transmitting medium. Nowadays, a light transmitting medium, 
or ether, is generally not regarded as necessary. In the nineteenth 
century, it was believed that, like sound, light was a continuous 
wave motion in a continuous medium. It was reasonable to 
postulate a mechanical ether in such circumstances. 

2.4. MAXWELL’S EQUATIONS AND THE PRINCIPLE 
OF RELATIVITY 

The laws of classical electromagnetism, such as Coulomb’s law, 
the Biot-Savart law, Faraday’s law etc., are summarized by 
Maxwell’s equations. If he has not already met them, the reader 
will almost certainly meet these important equations in his elec¬ 
tricity course. In addition to describing the behaviour of electric 
and magnetic phenomena. Maxwell’s equations can be used to 
interpret classical physical optics in terms of electromagnetic waves. 
Now, if the co-ordinates and time are transformed using the 
Galilean transformations, eqns (1.6)—(1.9), then Maxwell’s equa¬ 
tions do not obey the principle of relativity. It was natural in the 
last quarter of the nineteenth century to assume that Newtonian 
mechanics and the Galilean transformations were correct, since, 
at that time, no experiments on very high speed atomic particles 
had been carried out, so that no significant deviations from 
Newtonian mechanics had been recognized. However, if the 
Galilean transformations were assumed to be correct, Maxwell’s 
equations could only hold in one inertial reference frame, and if 
this were true, it should have been possible to identify this absolute 
reference frame by means of electrical experiments such as the 
Trouton-Noble experiment. (See Appendix 1.) However, in 
practice, it proved impossible to identify any absolute reference 
frame for the laws of electromagnetism 

What Einstein 1 did in the theory of special relativity, was to 
postulate that the laws of optics, electricity and magnetism obeyed 
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the principle of relativity. This meant abandoning the Galilean 
transformations and the concept of absolute time in favour of the 
Lorentz transformations. In his 1905 paper entitled On the Electro¬ 
dynamics of Moving Bodies in which the theory of special relativity 
was first developed, Einstein 1 wrote 

Examples of this sort, together with the unsuccessful attempts to discover 
any motion of the earth relatively to the ‘light medium’, suggest that 
the phenomena of electrodynamics as well as of mechanics possess no 
properties corresponding to the idea of absolute rest. They suggest 
rather that, as has already been shown to the first order of small quan . 
tities, the same laws of electrodynamics and optics will be valid for all 
frames of reference for which the equations of mechanics hold good. We 
win raise this conjecture (the purport of which will hereafter be called 
the Principle of Relativity’) to the status of a postulate, and also introduce 
another postulate, which is only apparently irreconcilable with the 
former, namely, that light is always propagated in empty space with a 
definite velocity c which is independent of the state of motion of the 
emitting body. These two postulates suffice for the attainment of a 
simple and consistent theory of the electrodynamics of moving bodies 
based on Maxwell’s theory for stationary bodies. The introduction of a 
luminiferous ether’ will prove to be superfluous inasmuch as the view 
here to be developed will not require an ‘absolutely stationary space’ 
provided with special properties, nor assign a velocity-vector to a point 
of the empty space in which electromagnetic processes take place. 

Later on in the same paper, Einstein 1 wrote out his two main 
postulates explicitly, namely, the principle of relativity and the 
principle of the constancy of the velocity of light. To quote from 
Einstein’s paper: 

The following reflexions are based on the principle of relativity and the 
principle of the constancy of the velocity of light. These two principles 
we define as follows: 

1. The laws by which the states of physical systems undergo change 
are not affected, whether these changes of state be referred to the one or 
the other of two systems of co-ordinates in uniform translatory motion. 

2. Any ray of light moves in the ‘stationary’ system of co-ordinates 
with the determined velocity c, whether the ray be emitted by a stationary 
or by a moving body. 

At the time the theory of special relativity was introduced (1905) 
the generally accepted laws of optics and electromagnetism were 
Maxwell’s equations. It would have been reasonable for Einstein 1 , 
after extending the principle of relativity to optics and electro¬ 
magnetism, to assume as his second postulate, that the appropriate 
laws of optics and electromagnetism were Maxwell’s equations. 
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However, Einstein 1 chose the principle of the constancy of the 
velocity of light as his second main postulate. According to this 
postulate the numerical value of the speed of light is the same in all 
inertial reference frames. By choosing the principle of the constancy 
of the velocity of light as his second postulate Einstein 1 was able to 
analyse the experimental measurement of the times of spatially 
separated events in the way described in Section 3.8, before 
developing the Lorentz transformations. After developing the 
Lorentz transformations from the principle of the constancy of the 
velocity of light, in the same paper, Einstein 1 went on to show that, 
if the co-ordinates and time are changed according to the Lorentz 
transformations. Maxwell’s equations, that is the laws of classical 
optics and electromagnetism, obey the principle of relativity. 

For the benefit of readers familiar with Maxwell’s equations, it 
is shown in Appendix 2 that, if it is assumed that Maxwell’s 
equations are correct and obey the principle of relativity, then the 
principle of the constancy of the velocity of light follows. 

The theory of special relativity evolved from classical electro¬ 
magnetism, as can be seen from the title of Einstein’s original 
paper 1 , which was, On the Electrodynamics of Moving Bodies. Effec¬ 
tively, Einstein extended the principle of relativity to classical 
optics and electromagnetism. Following the development of the 
Lorentz transformations, and the abandonment of the concept of 
absolute time, it was necessary to modify the laws of mechanics. 
There is now a large amount of independent experimental evidence 
in favour of the new laws of relativistic mechanics. One can now 
abandon the traditional historical approach to the theory of special 
relativity, which was based on the principle of the constancy of the 
speed of light. The theory can now be approached via experiments 
on high-speed mechanics, in the way outlined in Appendix 4. Some 
readers may find it easier to appreciate the necessity for the changes 
in the classical concepts of space and time due to the theory of 
special relativity, by considering the experimental result that 
accelerated electrons tend to a limiting speed, namely the speed of 
light. This approach is discussed in Section 5.4.2 and Appendix 
3(b). When approached via high-speed mechanics, the deviations 
from Newtonian mechanics are large, and it is obvious that experi¬ 
ments in high energy physics demand the abandonment of 
Newtonian mechanics in favour of the theory of special relativity. 
When special relativity is approached in the historical way via 
optics and electromagnetism, the experimental evidence is based on 
the absence of some small second order effects, which should have 
been present, if there was an absolute reference frame for the laws 
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of optics and electromagnetism (cf. the Michelson-Morley and the 
Trouton-Noble experiments described in Appendix I). For this 
reason some readers may find the experimental necessity for the 
theory of special relativity more convincing when it is approached 
via high-speed mechanics (cf. Appendices 3(b) and 4, and Sections 
5.4.2 and 5.8.6). The historical approach using the principle of 
the constancy of the speed of light will be used in Chapter 3, as it is 
convenient to discuss the measurement of the times of spatially 
separated events in terms of light signals. 

The validity of the theory of special relativity in inertial reference 
frames depends, primarily, on the fact that the predictions of the 
theory are in agreement with the experimental results. 
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THE LORENTZ TRANSFORMATIONS 


3.1. INTRODUCTION 

Einstein’s two main postulates in the theory of special relativity 
were the principle of relativity and the principle of the constancy 
of the velocity of light 1 . 

According to the principle of relativity, the laws of physics, including 
those of mechanics, optics, electromagnetism and nuclear physics 
are the same in all inertial reference frames. 

According to the principle of the constancy of the velocity of light the 
speed of light in empty space has the same numerical value in all 
inertial reference frames. There is now direct experimental evidence 
in favour of the principle of the constancy of the velocity of light. 

It must be pointed out that there are additional assumptions 
implicit in the theory of special relativity. For example, it is 
assumed that inertial frames exist and that, in such a reference 
frame, the motion of a body is uniform and rectilinear provided no 
forces act on the body. This definition is taken over from Newtonian 
mechanics. It is also assumed that in such a reference frame light 
is propagated rectilinearly and isotropically in free space. This 
assumes that all regions of space and all directions, in space are 
equivalent. It is also assumed in the theory of special relativity 
that Euclidean geometry can be used to calculate the relationships 
between geometrical quantities. It is assumed that all time intervals 
are equivalent. The validity of these extra postulates is not known 
a priori but has to be tested experimentally. In fact, some of these 
postulates must be modified within the context of the theory of 
general relativity. The refinements due to the general theory are 
generally very small, and need only be introduced when the 
accuracy of the measurements make it necessary; they will be 
omitted from the present discussion until the theory of general 
relativity is considered in Chapter 9. For most practical purposes, 
either the laboratory system or a system at rest or moving with 
uniform velocity relative to the fixed stars is a sufficiently good 
approximation to an inertial frame satisfying the above postulates. 

Before proceeding to develop the theory, the principle of rela¬ 
tivity and the principle of the constancy of the velocity of light are 
discussed to see if the postulates are reasonable. 
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3.2. THE PRINCIPLE OF RELATIVITY 

The principle of relativity is now extended to all the laws of physics 
including the laws of optics and of electromagnetism. According 
to the principle of relativity, if an isolated system is observed from 
two different inertial frames, one moving with uniform velocity 
relative to the other, though the observations on the system carried 
out in the two inertial systems yield different numerical values for 
some quantities, the laws derived on the basis of these observations 
should have the same mathematical form in both inertial frames. 

The laws of physics should not contain any terms referring to an 
absolute system. An example of the principle of relativity was 
given in Section 1.4, where Newton’s laws of motion were shown to 
transform into Newton’s laws under the Galilean transformations. 

This was illustrated by the example of a game of tennis played on the 
deck of a ship going out to sea. Whilst some of the measurements of 
an observer on the seashore would differ from those of an observer 
on the ship (e.g. measurements of the velocity of the ball) both 
observers would agree that Newton’s laws held to a very good 
approximation in their respective rest frames, provided the velocities 
of the ship and the tennis ball were very much smaller than the 
velocity of light. Nowadays, the principle of relativity is generally 
accepted, and none of the serious alternatives to the theory of I 

special relativity suggest that it does not hold. It must be em¬ 
phasized that we can have theories, such as Newtonian mechanics, 
which satisfy the principle of relativity, but which are at variance 
with the principle of the constancy of the velocity of light. It is 
this latter principle which distinguishes the theory of special 
relativity from other relativistic theories. The principle of rela¬ 
tivity says nothing about the numerical values of quantities in 
different inertial frames. It merely states that the laws are the same. 

It is the principle of the constancy of the velocity of light, in the 
form in which we are assuming it, which makes a precise statement 
about the numerical values of measurements carried out in different 
inertial reference frames. 

As an example of the scope of the principle of relativity in the 
theory of special relativity, consider the ship going out to sea with 
uniform velocity on a calm day. If experiments in mechanics, optics, 
electromagnetism or nuclear physics were performed in a room 
inside the ship, according to the principle of relativity, the laws 
determined on the basis of these experiments would be precisely the 
same as if they were carried Out in a laboratory at rest relative' 
to the earth. Without looking at anything external to the ship, one 
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could not determine the speed of the ship by means of any of these 
experiments. 

3.3. THE PRINCIPLE OF THE CONSTANCY OF THE 
VELOCITY OF LIGHT 

In this section a brief review of some of the evidence available in 
favour of the principle of the constancy of the velocity of light will 
be given. If the velocity of light is to have the same numerical 
value in all inertial frames, then the velocity of light must be 
independent of the velocity of the source of light relative to the 
observer and must have the same value in all directions and in all 
regions of empty space. 

The generally accepted laws of classical optics and electro¬ 
magnetism are Maxwell’s equations. It is shown in Appendix 2 
that, if Maxwell’s equations are correct and obey the principle of 
relativity, then the principle of the constancy of the velocity of 
light follows. Thus, if the accepted laws of classical optics and 
electromagnetism are to obey the principle of relativity, then the 
principle of the constancy of the velocity of light and the re-inter- 
pretation of space and time the acceptance of this principle 
necessitates, must be accepted. 

For a long time, there was no direct experimental check of the 
principle of the constancy of the velocity of light. Recently, several 
experimental checks have been performed. (James and Sternberg 2 , 
Sadeh 3 , Alvager, Nilsson and Kjellman 4 , Babcock and Bergman 5 , 
and Alvager, Farley, Kjellman and Wallin 6 .) For example, it will 
be described in Section 5.8.6 how Alvager and colleagues* deter¬ 
mined the velocities of photons arising from the decay of 7r°-mesons 
into two photons each. The velocity of the ir°-mesons, estimated 
using the equations of the theory of special relativity, was 0-99975 c. 
The measured velocity of the photons arising from the decay of 
moving 7 r # -mesons was (2-9977 ± 0-0004) X 10 8 m/sec. This value 
is consistent with the accepted value of 2-9979 X 10 8 m/sec for the 
velocity of light emitted by a stationary source. This result shows 
that the velocity of light does not add to the velocity of the source 
according to the Galilean transformations, eqns (1.10), (1.11) and 
(1.12). Thus, there is now direct experimental evidence in favour 
of the principle of the constancy of the velocity of light. 

The statement that the speed of light has the same numerical 
value in all inertial reference frames implies that the fundamental 
units of length and time are defined in the same way in all inertial 
reference frames, for example, as described in Section 1.2. 
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There is an alternative approach to the theory of special relativity 
based on relativistic mechanics. This approach is outlined in 
Appendix 4. Instead of the principle of the constancy of the speed 
of light, one can start from the experimental result, described in 
Section 5.4.2, that the limiting speed for electrons, accelerated in 
an electric field, is the speed of light in empty space. If all inertial 
frames are equivalent, this limiting speed should have the same 
numerical value in all inertial frames. If this were not true, then 
it would be possible to separate the inertial frames in terms of the 
different values of the limiting speed (cf. Section 5.4.2). This 
postulate will be called the principle of the constancy of the limiting speed 
for particles. It is illustrated in Section 5.8.6, that, by treating light 
photons as particles of zero rest mass travelling at the limiting 
speed c, the principle of the constancy of the speed of light is a 
special case of the principle of the constancy of the limiting speed 
for particles. 

3.4. THE LORENTZ TRANSFORMATIONS 

It has been shown that there is direct evidence in favour of the 
principle of the constancy of the velocity of light. This principle, 
together with the principle of relativity and those relating to the 
isotropy of space, the rectilinear propagation of light in free space 
and the applicability of Euclidean geometry, will now be taken as 
axiomatic. Some of the consequences of the axioms are derived 
in Sections 3.4 and 3.5, and it is then seen if these predictions agree 
with the experimental results. No infallibility will be claimed for 
the theory of special relativity, since all theories can be replaced by 
new theories, provided the new theories are in better agreement with 
the experimental results. It must be stressed that one must be 
prepared to replace Newtonian mechanics completely. That such a 
radical change is necessary can be seen immediately by considering 
a simple example. Consider two inertial frames £ and E', with 
E moving with uniform velocity v along the common positive Ox 
axis relative to E. Let a beam of light be emitted along the common 
x axis at the instant / = t' = 0, when the origins of £ and £' 
coincide. Let the light reach a detector at a position x in E at a 
time t. Let an observer at rest in X' record this event at a position 
x at a time t . Now according to the principle of the constancy 
of the velocity of light x\t = c in £ and x'\t' = c in £', where 
c is the velocity of light. Since the origin of E' has moved relative 
to E, x' must be less than x. Hence, t' must be less than t, so 
that according to the theory of special relativity time cannot be 
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absolute and one cannot write t' = t. Thus one must be prepared 
to change one’s intuitive ideas about absolute time, ideas that were 
formed in the macroscopic world we live in and in which all 
velocities are normally very much smaller than the velocity of 
light. One cannot say at the outset what all the changes in outlook 
must be. Initially the precise meaning of the symbols used in the 
theory must remain hidden in the text until the appropriate stage 
of the theory has been developed, such that the terms can be defined 
and interpreted. It was illustrated in Section 1.3, from a con¬ 
sideration of Newtonian mechanics, how the meaning of words 
such as force and mass only take meaning in terms of a particular 
theory. One must now be prepared to re-interpret even measure¬ 
ments of length and time in terms of the theory of special relativity. 

Some readers may prefer to repeat the above arguments, in 
terms of the motion of a particle, such as an electron, moving along 
the x and x' axes of S and S' respectively, at speeds very close to the 
limiting speed c relative to both S' and S. If the electron passes 
the origins of X and S' at t = t' = 0, and is detected at (*, t) and 
(x', t') relative to S and S' respectively, then xjt — c and x It — c. 
Since the origin of S', moves relative to X, x cannot be equal to x. 
Hence, t' cannot be equal to t. This illustrates, in terms of the 
motion of a very high energy electron, that time cannot be absolute. 

According to the principle of relativity the mathematical ex¬ 
pressions for the laws of physics should have the same form in all 
inertial frames of reference. In practice one needs only one such 
frame to describe the course of physical phenomena and for most 
practical purposes the laboratory frame is chosen. It is sometimes 
more convenient to carry out the calculations relative to another 
inertial frame moving with uniform velocity relative to the first. 
The question now discussed is, if measurements are made and the 
results interpreted relative to one inertial frame, then how can the 
numerical values of the variables relative to another inertial frame, 
moving with uniform velocity relative to the first, be calculated. 
We shall start by considering how the co-ordinates and time of an 
event in one inertial frame must be related to the co-ordinates and 
time of the same event, measured in another inertial frame moving 
with uniform velocity relative to the first, if the postulates of the 
theory of special relativity are taken as axiomatic. 

It is assumed that there are two inertial frames, S and S , each 
having its own set of imaginary rulers so that observers at rest in 
either reference frame can observe the co-ordinates of events when 
they happen. As the time interval between two events in S may 
not be equal to the time interval between the same two events 

40 


~Piifi.fi. 7fla£ltc#«ta£rca/ ~Ph-y.Al.cJL 





THE LORENTZ TRANSFORMATIONS 

measured in 27', one clock reading absolute time is not adequate, 
so it is imagined that two sets of clocks are distributed throughout 
space, one set of clocks at rest relative to 27, the other set at rest 
relative to 27'. All the clocks at rest in 27 are assumed to be synchro¬ 
nized with each other and all the clocks at rest in 27' are assumed to 
be synchronized with each other. (See Section 3.8 for a discussion of 
how the clocks may be synchronized.) It is assumed that all the 
clocks keep good time and, once synchronized, remain synchronized. 
[Some readers may prefer to visualize the determination of the 
co-ordinates and times of distant events using radar methods. Such 
readers can proceed first to Section 3.9 for a discussion of radar 
methods, then to Appendix 6 for a derivation of the Lorentz 
transformations and then proceed from eqn (3.7) of this Section.] 

Let the zero of time in both 27 and 27' be chosen to be the instant 
when the origins of 27 and 27' coincide (at t = t' = 0). Let 27' move 
with uniform velocity v along the common x and x' axes and let the 
Oy and O'y' and the Oz and O'z' axes coincide at t = t' — 0. It 
must be stressed that it is being assumed that the inertial frames have 
been moving and continue to move with uniform velocity relative 
to each other for all time, so that there is no acceleration of one 
reference frame relative to the other. A discussion of the possible 
effects of the acceleration of one reference frame relative to the other 
is deferred until Chapter 8, in which the clock paradox is discussed. 
It is also assumed that the measuring instruments used in 27' are 
not constructed in 27 and then accelerated into 27', but that observers 
in 27 and E' build their own apparatus from materials they find 
around them, using the same units to calibrate them as discussed in 
Section 1.2. This method overcomes any difficulties arising from 
possible effects due to the acceleration of measuring instruments. 

Consider a beam of light emitted at the instant when the origins 
of 27 and 27' coincide at t — t' = 0, as shown in Figure 3.1(a). Let 
the light reach a detector at the point P. Let this event be measured 
using the rulers and clocks at rest in 27 and let it be recorded to be at 
the position x, y, z at a time t. Assuming that light is propagated 
rectilinearly in the inertial frame 27, then an observer at rest in 27 
would say that the light travelled along the path OP, such that 

OP V x 2 + y 2 + -z 2 

_ = - = c 

or x 2 + y 2 + z 2 — c 2 t 2 = 0 (3.1) 

where c is the velocity of light. 
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Figure 3.1. (a) A beam of light is emitted when the origins of 2 and 2' coincide at t — 
t' = 0 . This light is detected at a point P at a later time. (b) The light appears to go 
along the path OP relative to 2, and along the path O'P relative to 2' 


An observer at rest in S’ would agree that the light did reach the 
detector (which may be moving relative to both S and S'). Using 
rulers and clocks stationary in S', let the observer at rest in S' 
record this event at a position x', y', z! at a time t'. Relative to S' 
the light would appear to have travelled the path O'P as shown in 
Figure 3.1(b) such that 


Q'p _ Vx'* + /• + z' 2 


(3.2) 


or 

x' 2 + y' 2 + z' 2 - cH' z = 0 (3.3) 


Notice that the same value was used for the velocity of light in both 
S and S', so as to be in accord with the principle of the constancy 
of the velocity of light. The co-ordinates x, y, z and t in S, and 
x’, y', z' and t' in S' refer to the same event, namely the detection of 
the light at P in Figure 3.1(b). The appropriate co-ordinate trans¬ 
formations must transform eqn (3.3) into eqn (3.1). Direct sub¬ 
stitution in eqns (3.1) and (3.3) will confirm that the correct 
transformations are 


_ (* ~ v Jl 

X Vl — v 2 jc 2 
/ =y 

z' — z 


= y(x — vt) 


(t — vxjc 2 ) 

Vl - v z /c z 
42 


= y(t — vxjc 2 ) 


(3.4) 

(3.5) 

(3.6) 

(3.7) 
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These are the Lorentz transformations. A derivation is given in 
Appendix 3(a). An alternative derivation of the Lorentz trans¬ 
formations, also based on the principle of the constancy of the speed 
of light, but using radar methods to determine the co-ordinates and 
times of events is given in Appendix 6. Some readers may find 
these radar methods easier to follow than using imaginary rulers 
and synchronized clocks distributed throughout space 

Alternatively, the Lorentz transformations may be developed 
from the experimental result, described in Section 5.4.2, that the 
speed of light in empty space is the limiting speed for electrons. A 
short proof is given in Appendix 3(b). 

The inverse transformations are 


(*' + vt') 

Vl - v 2 /c 2 


= y(V + vt’) 


y =/ 

z — z' 


(f + vx'jc 2 ) 
Vl - v 2 jc 2 


= y{t' + vx'/c 2 ) 


(3.8) 

(3.9) 

(3.10) 

(3.11) 


Notice that the inverse transformations can be obtained from eqns 
(3.4)-(3.7) by interchanging primed and unprimed quantities 
and replacing v by — v. (The inverses of all relativistic transforma¬ 
tions can be obtained by this procedure.) Eqns (3.4)-(3.7) are 
generally known as the Lorentz transformations. From time to 
time we shall use the abbreviation 

y = VT^Pp ( 3-12 ) 

where v is the velocity of one inertial frame relative to the other. 
Some writers use /? for 1/(1 — zj 2 /c 2 )L However, the symbol f} is 
used extensively for the ratio of the velocity of a particle to the 
velocity of light, particularly in high energy nuclear physics where 
the theory of special relativity is used extensively. For this reason 
we shall only use the symbol y for l/( 1 — p 2 /c 2 )*. It must be stressed 
that the Lorentz transformations give the relations between the 
co-ordinates and time of the same event measured in E and E' 
respectively. By an event is meant something that happens inde¬ 
pendently of the choice of any particular co-ordinate system, that is, 
something which observers in all inertial frames will agree happens, 
such as the collision of two particles. Observers in all frames of 
reference will agree that a collision between two particles takes place. 

43 


~Pufuc. Tltat/tcwtatica/ Pltylid 




THE LORENTZ TRANSFORMATIONS 

Eqns (3.1) and (3.3) relate to the co-ordinates and time of the 
event of detection of the light at the point P shown in Figure 3.1. 
The light could be detected by a photomultiplier moving relative 
to both E and E'. Observers in E and E' would agree that the light 
gave rise to an electrical impulse in the photomultiplier but they 
would attribute different co-ordinates and times to the event. 

The calculation of y can often be simplified by using a trigono¬ 
metrical method. Let vjc = sin 0. Then, 

1 1 

' Vl - sin 2 0 cos 6 

For example, if v/c = 0-8000, using four figure tables one finds that 
0 is 53°8' and sec 0 is 1 -6668. By direct calculation y is 

If v < c, then y — (1 — p 2 /c 2 ) _i is approximately unity and the 
Lorentz transformations reduce to the Galilean transformations, 
provided vxjc 2 <S; t. 

So far only one inertial frame, E', has been considered, moving 
with uniform velocity v relative to E. Now consider another co¬ 
ordinate system E" moving with uniform velocity w relative to E' 
along the common a: axis; E" is also an inertial frame. According 
to the Lorentz transformations 


(*' - “>*') (t'-wx’lc*) , ox 

Vi - w 2 /c 2 ’ y ~ y ’ ’ Vi - w*!c* ( ■ } 


If E' moves with uniform velocity v relative to E along the common 
x axis, one has 


, (* - vt ) 

* Vl - v 2 lc 2 ’ y 

Substituting for x',y', z' 
obtains 


■■y, 


z = z: 


t’ = 


(t — vx/c 2 ) 

V1 — V 2 jc 2 


(3.14) 


and t' from eqns (3.14) into eqns (3.13) one 


(x — vt) — w(t — vxjc 2 ) 
V(1 — v 2 jc 2 )( 1 — w 2 jc 2 ) 


x(\ + vw/c 2 ) — t(w + v) 

V(1 — v 2 jc 2 ) ( 1 — w 2 lc 2 ) 

Dividing top and bottom by (1 + viv/c 2 ): 

„ X — [( w + v)l( 1 + wvlc 2 )]t 

X ~ V[(l - v 2 lc 2 ){ 1 - w 2 /c 2 )-\l( 1 + wvjc 2 ) 2 
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Now the square of the denominator is equal to 

1 — v 2 /c 2 — w 2 /c 2 -f W 2 V 2 jc 4 

(1 + wvjc 2 ) 2 ~ 

(1 + wvjc 2 ) 2 - {{v 2 jc 2 ) + {w 2 jc 2 ) + (2 w vjc 2 )} _ ( W + V )2 

(1 + wv/c 2 ) 2 ~ 1 — C 2 { 1 + wvlc 2 ) 2 

Hence 

r * = x ~ Vt 
Vl - V 2 /c 2 

where 


Similarly, 


y== _w±v_ 
1 + wvjc 2 


■-r, z 


z and t" 


t - Vxjc 2 
Vl - V 2 jc 2 


Therefore two successive Lorentz transformations in the same 
direction are equivalent to a single Lorentz transformation, but the 
relative velocity of the origins of S and E" is equal to 


(w + v)l(l -f- wv/c 2 ) 

and not w + v, as would be the case if the Galilean velocity trans¬ 
formations were correct. If there were a particle at rest at the origin 
of E", then it would be moving with velocity w relative to E' and 
velocity (w + v)j{\ + wvjc 2 ) relative to E, hence a velocity u’ x 
along the x' axis of E’ is expected to transform into a velocity equal 
to 04 + »)/(1 + vu' x l<; 2 ) relative to E. 

In this section the Lorentz transformations were derived from the 
principle of relativity and the principle of the constancy of the 
velocity of light. Some of the implications of the Lorentz trans¬ 
formations will now be discussed. 


3.5. SOME MATHEMATICAL CONSEQUENCES OF THE 
LORENTZ TRANSFORMATIONS 

Now, without stopping to think too much at present about the 
physical implications of the theory, and leaving aside the precise 
meaning of the symbols, some of the mathematical consequences 
of the Lorentz transformations are worked out and their implica¬ 
tions are then considered in Sections 3.6, 3.7 and 3.10. 

3.5.1. Relativity of Simultaneity of Events 

Let two events occur at two separated points x x and x 2 in the 
inertial frame E. Let them be measured to occur at the same time 
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t in E. According to the Lorentz transformations these would be 
recorded at times t[ and t 2 by clocks at rest in E', where t x and t 2 
are given by 

t[ = y ["* — and h ~ Y 1 ~ *2 

Since x x is not equal to x 2 , t[ cannot be equal to t 2 , so that if the 
Lorentz transformations are correct, then two spatially separated 
events which are simultaneous in E, would not be measured to be 
simultaneous in E'. Similarly, if two events occur simultaneously 
at two spatially separated points x( and x' 2 in E', according to the 
Lorentz transformations they would not be simultaneous in E. 
Thus, according to the theory of special relativity the simultaneity 
of spatially separated events is not an absolute property as it was 
assumed to be in Newtonian mechanics. 


3.5.2. Time Dilation 

Let a clock at rest at the point * in E, as shown in Figure 3.2(a), 
give out signals, at times t x and t 2 , such that the interval between the 



(a) (b) 

Figure 3.2. (a) In 2 the two events occur at the same point x. ( b) In 2’ they do not occur 
* at the same point 


timw of emission of the signals measured in E is given by 

A t = t 2 - t x (3.15) 

Let the co-ordinates and times of these two events relative to E' 
be x[ and x' 2 and t[ and t 2 respectively, as shown in Figure 3.2(b). 
The time interval between the events in E' is equal to 

Af' = t 2 -t( 

Using the Lorentz transformations, 
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At V(1 - v*/c 2 ) ^ 3 ' 16 ^ 

According to the theory of special relativity the time interval 
between the two events should be longer in Z' than in Z. This is 
the phenomenon of time dilation. Notice that only one clock at the 
point a; was necessary to measure the time interval A t in Z, whereas, 
since x{ == y(x - vtj is not equal to x 2 = y(x — vt 2 ), two clocks at 
x i fnd *2 were necessary to measure the time interval A t' in Z'. 
This brings in the problem of synchronizing spatially separated 
clocks in Z'. This point is discussed in detail in Section 3.8. The 
time interval between two events taking place at the same point 
in an inertial frame and measured by one clock stationary at that 
point is called the proper time interval between the two events. If 
the theory of special relativity is correct, it necessitates a revision 
of the Newtonian concept of absolute time, though if v then 
At' At. Hence the deviations from absolute time are very small 
in our normal daily lives. 

If signals were given out at times t[ and t' 2 by a clock at rest at a 
point x in Z ', then in Z these events would be recorded at times 
f x and t 2 , such that 


4 h — y[t 2 ~h vx'/c 2 ] — y[t'i + vx'jc 2 ] 
or 


At = 


At' 

V 1 - v 2 jc 2 


(3.17) 


In this case At is the proper time interval and is measured by one 
clock, whereas At is now an improper time interval and has to be 
measured by two separated clocks in Z. Comparing eqn (3.16) 
with eqn (3.17) it will be seen that, according to the Lorentz 
transformations, proper time intervals in either Z or Z' should be 
measured to be dilated by a factor y in the other inertial frame. 


3.5.3. The Lorentz—FitzGerald Contraction 

In order to measure the length of a stationary object, in principle, 
one could lay the object on a ruler and then look successively at the 
positions of each end of the object. If the object were moving, then 
both ends would have to be measured at the same time. This point 
is brought out vividly by an example given by Lieber and Lieber 7 
and illustrated in Figure 3.3. If a fish is swimming with velocity v 
relative to the laboratory frame (which will be identified with Z), 
one would not measure the position of the fish’s tail at the point B 
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and then wait until it had swum until its mouth was opposite the 
point C before measuring the position of the other end of the fish. 
Not even the most fervent angler would stoop to that but would 
insist that one should measure both ends of the moving fish at the 
same time relative to the laboratory. The reference frame moving 
with the fish will be identified with S'. 

Let the positions of the ends of the fish be measured at the same 
time t in S and recorded to be at the points x 1 and x 2 as shown in 
Figure 3.3(a). The fish would record the events associated with the 



(a) ( t >) 

Figure 3.3. (a) The measurement of the length of a fish which is swimming with uniform 
velocity v relative to Z. ( b) In the inertial frame Z', the fish is at rest 

measurement of its length at positions x{ — y[x l — vt\ and x' 2 = 
y[x 2 — vt] at times t[ = y\t — vxjc 2 ] and t z = y[t — vxjc 2 ] re¬ 
spectively. As the fish does not move in S', its extremities can be 
measured at any time in S' so that, even though t[ is not equal to 4 
the length of the fish measured in S' is equal to x' z — x(. Hence, 

*2 — x( = y(x 2 — *i) 
or _ 

(* 2 - x i) = VI - » 2 /e a 04 - *[) 

Hence, 

/ = Z 0 Vl - (3.18) 

where l 0 — (x' 2 — x() is the length of the fish measured in the co¬ 
ordinate system in which it is at rest; l 0 is called the proper length 
of the fish, whilst l = (x 2 — * x ) is the length of the fish measured in 
an inertial reference frame relative to which the fish is moving with 
uniform velocity v. This is the phenomenon of length contraction. 
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Since t' 2 is not equal to t[ the fish would ‘say’ that according to 
‘its’ clocks the measurements were not carried out at the same time. 
For example, let two observers, at rest in the laboratory system 
carry out the actual experiment of measuring the length of the fish 
by noting, at the same time t, the positions of the front and rear 
of the fish on a ruler, which is stationary in the laboratory system. 
The fish would ‘claim’ that, relative to the reference frame S' in 
which it was at rest, one observer measured the front of its nose at 
a time t' 2 , and the ruler was then moved with velocity —v until a 
later time t[, when the second observer measured the position of its 
tail. The fish would ‘say’, ‘No wonder the laboratory observers 
measured my length to be too short. They moved the ruler’. It can 
be seen that the measurement of length involves the question of 
the simultaneity of spatially separated events. 

Since according to the Lorentz transformations y — y' and z = z', 
the dimensions of moving bodies perpendicular to their direction 
of motion are not measured to be Lorentz contracted. 

The phenomenon of length contraction is reciprocal. If the fish 
measured the length of an object, which was stationary in the 
laboratory, at the same time t' in S' and found the extremities 
of the object to be at the points x' 2 and x{, then, according to the 
Lorentz transformations, one would have 

*2 — *1 = 7l> 2 + »*'] — yi x l + vt '] 
or 

*2 — *1 = V'l — V 2 jc 2 (x 2 — x x ) 

Hence, in general, the measured length of an object is less in its 
direction of motion, when the object is moving relative to an 
observer,, compared with the length of the object measured by an 
observer at rest relative to the object. Notice it was said that a 
moving object is measured to contract in the direction of motion, in 
the sense that one observes the positions of the ends of its extremities 
on a ruler at the same time. What one sees (or photographs) 
depends on the light actually reaching the eye (or camera) at that 
instant, and the light from different parts of the object will have left 
the object at different times depending on the distances of the 
various parts of the object from the observer. What one sees depends 
also on the Doppler effect. These effects are discussed in Section 4.4. 

Let a body of proper volume V 0 move with uniform velocity v 
relative to S. If it is divided into thin rods parallel to v, since each 
one of these is reduced in length by an amount given by eqn (3.18), 
the measured total volume of the body relative to S should be 

V = V 0 Vl - v 2 lc 2 (3.19) 
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It has been shown in this section that if the theory of special 
relativity is correct then it necessitates the abandonment of the 
concepts of absolute length and absolute time. The above results 
are developed using the AT-calculus in Appendix 6. 

3.6. THE SIMULTANEITY OF SPATIALLY SEPARATED 
EVENTS 

The criterion for accepting a theory is whether or not the predictions 
of the theory are in agreement with the experimental results. 
In this section, it is discussed qualitatively whether the predictions 
of the theory of special relativity about length contraction and time 
dilation are plausible and in accord with experience. Before lengths 
can be measured and spatially separated clocks synchronized, one 
must decide what is meant by the simultaneity of spatially sepa¬ 
rated events. 

It would be easy to say which of two events took place first, if the 
events occurred at the same place. If the events were separated in 
space, then signals would have to be sent out when the events hap¬ 
pened. These signals would reach the observer at a later time. If 
the observer knew the speed of propagation of the signals and the 
positions of the events, then he could estimate when the events 
took place. Ideally, if signals could be transmitted with infinite 
velocity, such that they would reach the observer without time 
delay, then the observer need only note the time of arrival of the 
signals and equate it to the time of the event. However, there is 
no known way of transmitting signals instantaneously; all actual 
signals take a finite time to go from the event to the observer. The 
fastest signals available are light signals. It will be assumed, in the 
present discussion, that light signals are emitted when events 
happen. The light signals then travel to the observer who detects 
the signals with a light detector, such as a photomultiplier. If the 
observer records the signal at a time t and measures that the event 
was a distance r away from him, then he would estimate that the 
event took place at a time t — r[c, where c is the velocity of light. 

In order to overcome the necessity of measuring time intervals 
Einstein 8 considered an example in which the light signals from 
two events were observed at a point midway between the events. 
If the events occurred at A and B respectively and emitted light 
signals when they occurred, and if the light signals from A and B 
reached a point C half way between A and B at the same time, then 
it would be reasonable to assume that the events occurred simul¬ 
taneously at a time AB/2c before the light signals reached C. This 
is a satisfactory definition of simultaneity in this particular case, 

50 





THE SIMULTANEITY OF SPATIALLY SEPARATED EVENTS 


provided the velocity of light is the same in both directions and 
provided light travels in straight lines. 

Let two rockets denoted 1 and 2 pass each other. Let both rockets 
move with uniform velocity relative to the fixed stars so that the 
two co-ordinate systems, in which one or other of the rockets is 
at rest, are both inertial systems. Let the relative velocity of the 
rockets be v. The reference system in which rocket 1 is at rest is 
identified with £ and the inertial system in which rocket 2 is at rest 
is identified with £'. It is assumed that there are observers in both 
rockets and that these observers have devices, such as photomulti¬ 
pliers, to detect light signals coming from different directions and 
also measuring rods to measure distances. 

It is assumed that two lightning thunderbolts strike both rockets 
just as the rockets are passing each other. Let the thunderbolts 
leave marks on both rockets. These are two events independent 
of any co-ordinate system, since the observers on both rockets would 
agree that thunderbolts did strike on two occasions. Let the 
positions of the marks caused by the lightning on rockets 1 and 2 
be denoted by A and A' and B and B' respectively, and let the light 
detectors on rockets 1 and 2 be at C and C' respectively, as shown in 
Figure 3.4. It will be assumed that each observer finds afterwards 
that his light detectors were exactly half way between the marks 
left on his rocket, that is AC = CB on rocket 1 and A'C' = C'B' on 
rocket 2. The measuring sticks need not even be calibrated in the 
same units to show these equalities. The length measurements can 
be carried out by the observers on the rockets at their leisure, by 
walking along the rockets. 

It is assumed that the light signals from A and B arrive at the 
same time at the detectors at C as shown in Figure 3.4(c). If the 
observer on rocket 1 assumes that the velocity of light in empty 
space is independent of the velocity of the source and is the same 
in all directions in space, as predicted by Maxwell’s equations, 
and the principle of the constancy of the velocity of light, then he 
would conclude that the two lightning flashes struck simultaneously 
at A and B at a time ACjc before the light signals reach C. 

At the instant the lightning strikes at A, then A coincides with A’, 
and when it strikes at B, then B coincides with B'. Since the light¬ 
ning strikes at the same time at A and B relative to rocket 1, and 
since AC = CB and A'C' = C'B', it is reasonable to assume that, 
relative to £, C coincides with C' when the lightning strikes at A 
and B as shown in Figure 3.4(a), provided all times are measured 
relative to rocket 1 (£). The light emitted by the lightning takes 
a finite time to reach the detectors at C and C', and during this 
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( The co-ordinate system in which\ 
rocket 1 is at rest and in which j 
rocket 2 is moving with velocity vj 



Figure 3.4. (a) Two rockets labelled 1 and 2 are passing each other when lightning 
strikes twice leaving marks on both rockets. The lightning strikes simultaneously in the 
co-ordinate system E in which rocket 1 is at rest, such that the light signals from A, A' 
and B, B' reach the light detector at C, situated halfway between A and B, simultan¬ 
eously as shown in ( c ). The light signal from A, A' passes C' before the light from 
B, B’ does, as illustrated in (b) and (d) respectively 

time interval rocket 2 moves relative to rocket 1 such that C' moves 
towards A, as illustrated in Figure 3.4(b). Thus the light travelling 
from A towards C must pass C’ before reaching C, as shown in 
Figure 3.4(b). A little later, relative to E, the light from the flashes 
at A and B reach C simultaneously, as shown in Figure 3.4(c), so 
that the observer on rocket 1 concludes that the lightning struck 
at the same time at A and B. Notice the light from B and B’ does 
not reach C' until after it has passed C as shown in Figure 3.4(d). 
As a result the observer on rocket 2 would record the signals from 
A and A’ before the signals from B and B', say at times t' and 
t' + At' respectively. The observer on rocket 2 measures A'C' 
to be equal to C'B'. If he assumes that the velocity of light is the 
same in both directions, and he is just as entitled to do so as the 
observer on rocket 1, then he estimates that the events at A' and 
B' occurred at times t' — A'C' jc' and t' + A t' — B'C'\c' respectively, 
where c' is the velocity of light relative to rocket 2. The observer 
on rocket 2 would conclude that the events did not occur simul¬ 
taneously in his reference frame, and would conclude that the event 
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A happened before the event B'. Both observers use exactly the 
same, criterion for the simultaneity of spatially separated events. It 
is quite possible for the lightning to strike such that the light sig na ls 
reach C simultaneously, in which case the observer on rocket 2 
would record the events as simultaneous; but in this case the light 
from B will pass C before reaching C', whilst the light from A reaches 
C at a later time so that the observer on rocket 1 would record that 
the lightning struck at B before A. When the problem of the 
simultaneity of spatially separated events is related to measurements 
that can actually be performed, then, if the velocity of light in empty 
space is independent of the velocity of the source relative to the 
observer, and is the same in all directions, it can then be seen that 
events measured to be simultaneous in one inertial frame are not 
measured as simultaneous in another inertial frame moving with 
uniform velocity relative to the first. 

Notice that in Figure 3.4 the relative velocity of the two rockets 
was comparable with the speed of light. Normally rockets go rather 
slower than this so that the differences from absolute simultaneity 
would normally be too small to be noticed in our daily lives. 

The Lorentz transformations can be used to fill in the mathe¬ 
matical background to the above qualitative discussion. Let the 
event of the lightning’s striking at A, A' be recorded at (x l5 f x ) 
relative to 27, the inertial reference frame shown in Figure 3.4 and in 
which rocket 1 is at rest. Let the same event at A, A' be recorded 
at.(xi, < x ) relative to 27', the inertial reference frame in which rocket 
2 is at rest as shown later in Figure 3.5. Let the event at B, B' be 
recorded at (x 2 , t 2 ) and (x 2 , t 2 ) relative to 27 and 27' respectively. 
Applying the Lorentz transformations, 

t'z = y(h — vx 2 /c 2 ) 

and 

*x = V(h — vxjc 2 ) 

If the events are simultaneous relative to 27, t x = t 2 . Hence, 

K — K = 7 v ( x i ~ x a )lc 2 

In Figure 3.4(a), x t > x 2 so that t 2 is greater than t[. Hence relative 
to the inertial frame 27' in which rocket 2 is at rest, the event at 
B, B' should be measured to occur after the event at A, A', as 
predicted previously by our qualitative discussion. 

It will be assumed that the lightning marks are at the ends of the 
rockets. Relative to 27, rocket 2 is moving with uniform velocity v. 
According to the Lorentz transformations, relative to 27, rocket 2 
should be measured to be Lorentz contracted such that its measured 
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Figure 3.5. Co-ordinate system S' in which rocket 2 is at rest, rocket 1 moves to the left 
with uniform velocity v. (a) Since the light signal from A, A' reaches C before the 
light signal from B, B', it is calculated that the lightning struck at A, A' before B, 
B' relative to S'. ( b) Since rocket l moves relative to S' in the time interval between 
these two events, it is concluded that in S', AB is less than A'B'. In S AB is equal to 
A'B' as shown in Figure 3.4. 


length, relative to S, is (Z 0 ) 2 V 1 — v 2 lc 2 , where (Z 0 ) 2 is the proper 
length of rocket 2 measured in the inertial reference frame S' in 
which it is at rest. Since rocket 1 is at rest relative to S, its measured 
length is its proper length (Z 0 ) v Relative to S, Figure 3.4(a), the 
measured lengths of the rockets are the same, since, relative to S, A 
coincides with A' when B coincides with B'. 

Hence 

(Z*)i = (Z,),Vl - v 2 /c 2 = ( l 0 )Jy (3.20) 

It is of interest to note that (Z„) 2 the proper length of rocket 2 is 
greater than (Z^ the proper length of rocket 1. 

3.7. THE LENGTH OF A MOVING OBJECT 

As an example of the measurement of the length of a moving object 
in a direction parallel to its direction of motion, the measurement 
of the lengths of rockets 1 and 2 in Figures 3.4 and 3.5 will be 
considered. It will be assumed that the lightning marks are at the 
ends of the rockets. 

It was assumed in Figure 3.4, that the lightning strikes simul¬ 
taneously in the co-ordinate system S in which rocket 1 is at rest, 
such that the light signals reach C at the same time as shown in 
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Figure 3.4(c). An observer in rocket 1 will conclude that A coincided 
with A' when B coincided with B', so that, relative to rocket 1 
(E), the measured lengths of the two rockets are the same. It was 
pointed out in Section 3.6 that according to the Lorentz trans¬ 
formations (Z 0 ) 2 the proper length of rocket 2 is greater than (Z 0 ) 1 
the proper length of rocket 1. The measured lengths of the rockets 
are the same relative to 27 (Figure 3.4(a)), since rocket 2 is Lorentz 
contracted, whereas rocket 1 is not. In Section 3.6, we found 

(^o)i = (l a )iV 1 - v 2 /c 2 = (/ 0 ) 2 /y (3.20) 

It was shown in Section 3.6 that, if the light signals reach C simul¬ 
taneously, as shown in Figure 3.4(c), then the observer on rocket 2 
would conclude that A' coincided with A before B' coincided with 
B, since the light from A' reaches C' (at a time t') before the light 
from B’ (which arrives at C' at a time t' + At'). Consider the 
co-ordinate system 27' in which rocket 2 is at rest. In this co¬ 
ordinate system, rocket 1 moves with velocity — v as shown in 
Figure 3.5. The observer at rest on rocket 2 would record A' 
coinciding with A at a time t' — A'C'/c, as shown in Figure 3.5(a), 
and would record B' coinciding with the mark B on rocket 1 at a 
later time t' + At' — A’C'/c. During the time interval between 
the events at A' and B' in 27', according to the observer on rocket 
2, A would have moved relative to A' such that at the time 
t' + At' — A'C'/c, the distance between the mark A on rocket 1 
and mark B on rocket 1 is less than A'B' (measured in 27') as shown 
in Figure 3.5(b). Thus the observer on rocket 2 would conclude that 
in 27', the measured length of the moving rocket, number 1, is less 
than the length of the stationary rocket, number 2 in this instance. 
On the other hand, it was shown that an observer on rocket 1 
would conclude that, relative to 27, the measured lengths of the 
rockets are the same, as shown in Figure 3.4(a). It can be seen 
that the different measures of length are intimately connected with 
the lack of absolute simultaneity. The effects were exaggerated by 
making the relative velocity of the rockets comparable with the 
speed of light. 

The Lorentz transformations will now be used to fill in the 
mathematical background to the above qualitative discussion. In 
27, rocket 2 is at rest as shown in Figure 3.5, and its measured length 
A'B' is its proper length (/ 0 ) 2 = x[ - Relative to 27', rocket 1, 
which has a proper length (lg)i, is moving with uniform velocity — v 
and, according to the Lorentz transformations, should be Lorentz 
contracted such that its measured length l( relative to 27' is 

l i = VoW 1 — » 2 /c a 
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Using eqn (3.20) 

l[ = (/,),(1 - v 2 h 2 ) (3-21) 

As in Section 3.6, let the event at A, A' be at (x l5 b) and (x v b) 
relative to 27 and 27' respectively, and let the event at B, B' be at 
(x 2 , t 2 ) and (x 2 , t 2 ) relative to 27 and 27' respectively, where b = t 2 . 
Since b = t 2 and x 1 — x 2 = (/ 0 ) ls from the Lorentz transformations, 
we have 

t 2 — t[ = yv{x 1 — x 2 )lc 2 = yv(l 0 )il^ 

Using eqn (3.20) 

t 2 — = v(l 0 )Jc 2 

Relative to 27', rocket 1 moves with uniform velocity — v for the 
time interval (t 2 —- t[) between the events at A, A' and B, B'. 
Relative to 27', Figure 3.5, the measured length of rocket 1 should be 
equal to A'B' minus the distance travelled by the rocket in the 
time interval (t 2 — t(), that is 

W — (^ 0)2 — — b.) — (^ 0)2 &2 (^o )2 l° 2 

This is in agreement with eqn (3.21). It can be seen that the 
Lorentz transformations fill in the mathematical background to 
the qualitative discussions given in Sections 3.6 and 3.7 and give a 
consistent account of the measurement of the times of the events 
and the lengths of the rockets relative to both 27 and 27'. 

The measurement of lengths perpendicular to the direction of 
relative motion will now be considered, using the example of the 
rockets. Let the lightning now strike at points A and A’ and B and 
B' at the edges of the rockets, when the rockets are directly above 
each other, as shown in Figure 3.6(a), leaving marks on both rockets. 
Let the observer on rocket 1 find that his light detectors at C were 
half way between the marks at A and B on rocket 1, and let the 
observer on rocket 2 find that his light detectors at C' were half way 
between the marks at A' and B' on rocket 2. Let AB be perpen¬ 
dicular to the direction of motion of rocket 2 as shown in Figure 3.6. 
Let the lightning strike simultaneously relative to rocket 1, such 
that the light signals reach the detectors at C at the same time as 
shown in Figure 3.6(b). The observer on rocket 1 will conclude that 
A coincided with A' at the same instant as B coincided with B'. 
Now in the co-ordinate system 27 in which rocket 1 is at rest, C' 
moves in a direction perpendicular to AB so that, whatever the 
position of C' relative to rocket 1, AC" = BC' as shown in Figure 
3.6(c). If space is isotropic and if the velocity of light is the same in 
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all directions, as is assumed in the theory of special relativity, then 
the light from A and B should reach C' at the same time relative to 
rocket 1. The coincidence of the light from both A and B with the 
detectors at C' is an event independent of any particular reference 
frame, so that the light from A and A' and B and B' must be 


(a) 


(b) Later 



A I A’ 



B 1 8' 

The light signals reach 
C simultaneously 


Rocket 2 moves with 
velocity V relative to 
rocketl; vis perpendicular 
to 4fi 



B B' 

The light signals reach 
C' simultaneously 


Figure 3.6. In this case the lightning strikes at the sides of rockets 1 and 2 when they 
are directly above each other, such that relative to 2, the co-ordinate system in which rocket 1 
is at rest, the lightning strikes simultaneously, so that the light signals from A, A’ and B 
B reach C, which is half way between A and B, simultaneously as shown in (b) In this 
case the light signals from A, A' and B, B' also reach C' which is halfway between A' 
and B on rocket 2, simultaneously as shown in (c), so that it is calculated that the lightning 
also strikes simultaneously relative to rocket 2 

measured to arrive at C' simultaneously relative to rocket 2. Since 
the observer on rocket 2 measures A’C' to be equal to B'C', he will 
deduce that A' coincided with A at the same instant as B' coincided 
with B. Thus the observers on both rockets 1 and 2 would agree, 
in this case, that A coincided with A' when B coincided with B\ 
If they chose the same units of length, they would agree that the 
length AB measured in 27 (rocket 1 at rest) was equal to the length 
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A'B' measured in 27' (rocket 2 at rest), or, in general. 



in agreement with the Lorentz transformations. 

It has been shown that, according to the theory of special rela¬ 
tivity, the measured length of an object is reduced in the direction of 
relative motion when the object is moving relative to an observer 
measuring its length. This ‘contraction’ arises naturally if simul¬ 
taneity is not absolute. On the old ether theories the Lorentz- 
FitzGerald contraction was attributed to motion of the body 
relative to the ether. According to these theories a body at rest 
on the earth should contract due to motion relative to the ether, 
which was considered an absolute frame of reference for electro¬ 
magnetic phenomena. On the other hand, according to these 
theories, a body at rest relative to the ether, but moving relative to 
the earth, should not be contracted as seen from the earth. Accord¬ 
ing to the theory of special relativity the change in measured length 
is associated with the relative motion between the object and the 
observer. The effect is perfectly reciprocal, and is due only to the 
relative motion between object and observer and not due to motion 
relative to any particular absolute reference frame. It is often asked 
whether the length contraction is ‘real’. What the principle of 
relativity says is that the laws of physics are the same in all inertial 
frames, but the actual measures of particular quantities may be 
different in different co-ordinate systems. For example, in the 
example of a game of tennis on board a ship going out to sea, 
discussed in Section 1.4, it was reasonable within the context of 
Newtonian mechanics to find that the velocity of the tennis ball 
was different relative to the ship than relative to the beach. Is 
this change of velocity ‘real’ ? According to the theory of special 
relativity, not only the measures of the velocity of the ball relative 
to the ship and the seashore will be different, but the measures of 
the dimensions of the tennis court parallel to the direction of relative 
motion and the measures of the time of events will also be different. 
Both the reference frames at rest relative to the ship and the beach 
can be used to describe the course of the game and the laws of 
physics will be the same in both systems, but the measures of 
certain quantities will differ. 

In Newtonian mechanics it was possible to make statements such 
as: ‘the length of the rod is 30 cm.’ Since, in the theory of special 
relativity, the length of a body is not absolute, such a statement has 
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no meaning unless the length is specified relative to some standard 
of rest. 


3.8. SYNCHRONIZATION OF SPATIALLY SEPARATED 
CLOCKS 

In Section 3.6, simultaneity was only defined for a particularly 
simple case in which no actual measurement of time was involved. 
For purposes of discussion it was imagined previously that a series 
of clocks was distributed throughout space so that the time of events 
could be measured on the spot when the events happened. It was 
assumed that these clocks were good clocks and kept good time; 
but the question arises, how can such clocks be synchronized ? 
One could suggest taking a standard clock around to each of them 
in turn and setting each clock in turn, but one cannot be sure that 
the standard clock would be unaffected by the transportation. It 
was pointed out in Section 1.3 that certain quantities have to be 
defined in terms of a particular theory, for example in Section 1.3 
force and mass were defined in terms of Newton’s laws of motion. 
In the theory of special relativity, in addition to force and mass, 
it is necessary to define how to synchronize spatially separated 
clocks in a way consistent with the theory of special relativity. The 
definition of distant simultaneity chosen by Einstein 1 was as follows: 

If at the point A of space there is a clock, an observer at A can determine 
the time values of events in the immediate proximity of A by finding the 
positions of the hands which are simultaneous with these events. If there 
is at the point B of space another clock in all respects resembling the one 
at A, it is possible for an observer at B to determine the time values of 
events in the immediate neighbourhood of B. But it is not possible 
without further assumption to compare, in respect of time, an event at A 
with an event at B. We have so far only defined an ‘A time’ and a 
‘B time’. We have not defined a common time for A and B. The latter 
time can now be defined in establishing by definition that the ‘time’ 
required by light to travel from A to B equals the time it requires to 
travel from B to A. Let a ray of light start at the ‘A time’ t A from A 
towards B, let it at the ‘B time’ t B be reflected at B in the direction of A, 
and arrive again at A at the ‘A’ time ‘t'fi. 

In accordance with definition the two clocks synchronize if 

t B - t A = £ - t B (3.22) 

According to the principle of the constancy of the velocity of 
light, the speed of light is the same in all directions of empty space, 
so that light should take the same time to go from A to R as from 
B to A. According to Einstein’s definition, if the time recorded by 
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the two stationary clocks at A and B for the light to go from A to B 
is the same as the time they record for the light to go from B to A, 
then the clocks must be synchronous. If they were not synchronous, 
they would not measure the times to be the same. This definition 
of synchronization is based on the principle of the constancy of the 
velocity of light. If the velocity of light has the same numerical 
value in all regions of space, and if light travels in straight lines, 
then it is possible to synchronize all the clocks which are at rest in 
any inertial frame. 

Equation (3.22) can be rewritten in the form 

~ + tj) (3.23) 

If an astronaut, in a space ship at rest relative to the inertial 
fr am e in which the earth is at rest, wanted to synchronize his watch 
with say Greenwich Mean Time, he could ask his base station on 
earth to send him a radio signal. On receipt of this signal, the 
astronaut should send a radio message back to earth immediately. 
He should also record the time on his watch when he received the 
radio signal. If the radio signal left earth at t A G.M.T. and was 
received back on earth at t' A G.M.T., then the time the astronaut 
received the radio signal was \{t A + t'J) G.M.T. This information 
can be transmitted to the astronaut, who can then adjust his watch 
to read Greenwich Mean Time. 

Alternatively, if he knew how far away he was from the earth, 
the astronaut could ask his base station to send him a message 
saying what the time was, for example, they could transmit a 
television picture of Big Ben. The astronaut would have to allow 
for the time the radio signal took to reach him. If he were on the 
moon, the signals would take ~ljsec to reach him. If he were 
somewhere in the solar system the signals would take a time of 
the order of minutes to reach him. Even if one were on the earth 
and looked at a distant clock through a telescope or even at one’s 
own wristwatch one should, in principle, allow for the time light 
takes to reach the eye from the clock. In normal everyday life, 
such a time delay is so small that it can generally be ignored, 
though, of course, in principle it is always present. 

Einstein chose to define how to synchronize spatially separated 
clocks using light signals. This led many people to assume that 
the concept of time in the theory of special relativity depends on the 
law of propagation of light. Einstein’s own view can be summarized 
by the following quotation 9 : 

The theory of relativity is often criticized for giving, without justification, 

a central theoretical role to the propagation of light, in that it founds 
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the concept of tune upon the law of propagation of light. The situation 
is however, somewhat as follows. In order to give physical significance 
to the concept of time, processes of some kind are required which en able 
relations to be established between different places. It is immaterial 
what kind of process one chooses for such a definition of time. It is 
advantageous however, for the theory to choose only those processes 
concerning which we know something certain. This holds for the 
propagation of light in vacuo in a higher degree than for any other process 

H A 1 LorenhT COnSldered thanks t0 the investigations of Maxwell and 


Any process other than light signals can be used to synchronize 
spatially separated clocks, such as elastic couplings or electrical 
signals transmitted along connecting wires. If the synchronization 
is carried out in a way consistent with the theory of special relativity 
no contradiction with the light signals method should arise, if the 
theory of special relativity is correct. In practice, it is often simpler 
to use radar methods to determine the positions and times of 
distant events. An account of this method will now be given. 


3.9. RADAR TECHNIQUES FOR MEASURING 
POSITIONS AND TIMES OF EVENTS 

Radar techniques can be used to measure the times and positions 
ot events, for example, to plot the position of a moving rocket. Let 
a directional antenna transmit a short radio pulse at a time t as 
shown in Figure 3.7. This radio signal travels with a speed c in 



Moving 

rocket 


Figure 3.7. A directional antenna is used to transmit a radar pulse which is 
reflected by a moving rocket 
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empty space. When it reaches the rocket, a small fraction of the 
radio waves are reflected in the backward direction, in the direction 
of the transmitter. These signals can be received and amplified. 
Let the reflected signal be received back at the radar base at 
t + At. If the speed of radio signals is the same in both directions, 
as required by the principle of the constancy of the velocity of light, 
then it takes the radio signals the same time to reach the rocket as 
to return from the rocket. Hence the radio waves are reflected 



Figure 3.8. A space-time diagram showing the world line of the 
rocket. The radar pulse is transmitted from the radar base at the 
world point A having co-ordinates x = 0, m — ct. It is reflected 
by the rocket at the world point B atx = c At/2, m = e(< + At/2) 
and received back at the radar base at the world point A' at x = 0, 
m = c(t + At) 


from the rocket at a time t + At/2. In a time interval At/2 the 
radio signals travel a distance c At/2. Hence at the time t + At/2, 
the rocket is at a distance of c At/2 from the radar base. If a 
directional antenna is used, the position of the rocket in space can 
be determined using only one clock, in the radar station on earth. 

It is convenient to plot the successive positions of the rocket 
graphically. It will be assumed that the rocket moves along the x 
axis of an inertial frame S. It will be assumed that the radar 
station is at x = 0. It is conventional in the theory of special 
relativity to plot m = ct against x as shown in Figure 3.8. Such a 
diagram is called a space-time diagram. The displacement of a 
rocket moving with uniform velocity is shown in Figure 3.8. Such 
a curve is called a ‘world line’. Events are called ‘world points’. 
The ‘world lines’ of light or radio signals, travelling with speed c 
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along the x axis, are at 45 degrees to the x and ct axes, as shown in 
Figure 3.8, since in a time A t light signals travel a distance Ax = cAt. 
The event corresponding to the emission of the radar pulse is 
represented by the world point A (x = 0, m — ct). The ‘world line’ 
of the radio signal is shown dotted. It reaches the rocket at the 
‘world point’ B having co-ordinates x = cAt/2; m = c(t -f At/2). 
The reflected radio signal reaches the radar base at the ‘world point’ 
A' having co-ordinates x = 0, m = c(t + At). By repeating the 
experiment, the successive positions of the rocket can be determined 
and its velocity measured. If the rocket is moving with uniform 
velocity its ‘world line’ is straight as shown in Figure 3.8. If the 
rocket is accelerating, its ‘world line’ is curved. 

Some readers may find it simpler to visualize the measurement 
of the times and co-ordinates of distant events, using the radar 
methods described above, rather than use all the paraphernalia of 
rulers and synchronized clocks distributed throughout space, with 
observers distributed throughout space to record events when and 
where they occur. Radar methods can be used to develop the 
Lorentz transformations in the way described in Appendix 6, where 
references to other works will be found. This method is now 
known as the method of the ff-calculus. 

In this Section, a space time diagram has been used to represent 
the displacement of a rocket relative to one inertial frame only. 
The method will be extended in Chapter 6 in such a way that the 
same ‘world lines’ and ‘world points’ can be used to represent the 
displacement of the rocket relative to more than one inertial 
reference frame. 

Example. A rocket is moving directly away from the earth with uniform 
velocity. A radar pulse is transmitted from the earth at 12.00 G.M.T. The 
pulse reflected from the rear of the rocket is received back at the radar base 
at 12.02 G.M.T. The pulse reflected from the front of the rocket is received 
back at the radar base 2 fisec later. A second radar pulse is transmitted at 
12.04 G.M.T. and, after reflection at the rear end of the rocket, is received 
back at base at 12.18 G.M.T. Find the speed and proper length of the 
rocket. 

As practice, the reader should draw an accurate space-time diagram 
showing the ‘world lines’ of the radar signals and the path of the 
rocket as shown in Figure 3.9(a). The reader can then estimate the 
speed of the rocket graphically. 

If it returns to the radar base at 12.02 G.M.T., the radar pulse 
transmitted at 12.00 G.M.T. reaches the rear of the rocket at 12.01 
G.M.T. The distance travelled by the radio signal in the 60 sec 
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between 12.00 and 12.01 G.M.T. is 60c, or 1-8 X 10 10 m. Thus at 
12.01 G.M.T. the rear of the rocket is 1-8 X 10 10 m from the earth. 

Similarly, the radar signal transmitted at 12.04 G.M.T. reaches 
the rear of the rocket at 12.11 G.M.T. when it is 7 X 60c or 
12-6 X 10 10 m from the earth. In the 10 X 60 sec between 12.01 



Figure 3.9. (a) Space-time diagram showing the world lines of two radar signals 
sent to and reflected by the rear of the rocket. The world line of the rocket can be 
determined graphically. Since tan 6 = 3/5, the speed of the rocket is O’6c. 

(b) Enlarged space-time diagram around the world point B in Figure 3.9(a). 

The first radar pulse is reflected from the rear of the rocket at the world point B 
and from the front of the rocket at the world point B', 1 p sec later. Hence 
PB ' = 10“®c = 300 m. The length of the rocket is the distance between the 
world lines of the rear and front of the rocket at a fixed time. For example, at 
12*01 + (1 (i sec) the length of the moving rocket is given by RB'. By drawing 
the diagram to scale , the reader will find that the length of the moving rocket is 120 m. 

By trigonometry, PR = 300 tan 6 — 180 m, so that RB / =120 m. 

and 12.11 G.M.T. the rocket travels a distance of 6 X 60c. Hence 
the speed of the rocket is 0-6c. 

The reader should draw the world lines of the front and rear of 
the rocket and the world lines of the first radar pulse on an enlarged 
scale and determine graphically the length of the moving rocket, as 
shown in Figure 3.9(b). 

The first radar signal is reflected from the rear of the rocket at 
12.01 G.M.T. and from the front of the rocket 1 /isec later. If v 
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is the speed of the rocket, in 1 fi sec the front of the rocket moves a 
distance 10 -6 p, so that after passing the rear of the rocket, the radar 
pulse covers a distance / -(- 10 _ S before reflection at the front end 
of the rocket 1 /usee later, where l is the length of the moving rocket. 
In 1 fisec the radar signal travels a distance 10 -6 c. Hence 

10-«c = l + 10 -6 w 

Since v — 0-6c, we have 

l = 10-« x 3 x 10 8 (1 - 0-6) = 120 m 

This is the length of the moving rocket. From eqn (3.18) for the 
Lorentz contraction, 

/ = l 0 V 1 - v */c 2 

Hence 

l 0 = l/Vl — v 2 /c 2 = 120/Vl - 0-6 2 

or 

l 0 = 150 m 

Hence the proper length of the rocket is 150 m. This example 
illustrates how one could determine the length of a moving body 
using radar techniques. 

3.10. EXAMPLES OF TIME DILATION 
3.10.1. Gedanken Experimente 

Let a lamp, which is stationary in the inertial frame E' , emit a 
pulse of light which travels a distance along the y axis before 
it is reflected by a mirror, as shown in Figure 3.10(b). In E' the 
time interval A t' between the light leaving the x' axis and returning 




Figure 3.10. The measurement of a time interval in terms of the distance light travels 
in the time interval 
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A.' = & 
c 


The motion of the pulse of light relative to E is illustrated in 
Figure 3.10(a). Notice the light leaves at one point on the * axis 
and returns at another. The time interval A t between the light’s 
leaving and returning to the x axis is now given by 

r 

. V>° + hr 


The same value is used for the velocity of light in E and E' so as to 
be in accord with the principle of the constancy of the velocity of 
light. Rearranging eqn (3.24), one has 


In Section 3.7 it was shown directly from the postulates of the 
theory of special relativity that y a should be equal to y' 0 . Hence, 
usin g the distance light travels as a measure of a time interval, and 
assuming c is the same in all inertial frames, 


-ST 


This is the expression for time dilation. 

Notice that if the time interval between the time of emission of 
the light and the time of its return to the x axis were measured by 
clocks at rest in the two inertial frames, then one would need one 
such clock at rest in E' to measure the time interval A t' which would 
be a proper time interval. On the other hand, in E one would 
require two such clocks, one at rest at A to record the time the pulse 
left the x axis, and the other at rest at B to record the time of the 
return of the light pulse to the x axis. If the clocks at rest in E were 
synchronized as described in Section 3.8, then if the theory of 
special relativity is correct, the difference between the readings of 
the one clock in E' should be related by eqn (3.25) to the difference 
between the readings of the two clocks at rest at A and B in E, if 
the same unit of time is used in E and E'. 

The relation (3.25) is perfectly reciprocal, since if the light 
returned at the same point in E, one would have At — 2y 0 [c, 
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whereas the light pulse would return at different points on the x' 
axis in E', and one would then have 


At' = 


2jj 

cV 1 — V 2 /c 2 


(3.26) 


Experiments of the type described above have not been per¬ 
formed on the same system simultaneously to measure the same 
time interval in two different inertial frames, so that the relations 
(3.25) and (3.26) have not yet been confirmed directly by experi¬ 
ment. However, the time variable appears in the interpretation of 
many phenomena and the interpretations of some of these depend 
almost entirely on the Lorentz time transformations. Two experi¬ 
ments of this type, namely the Doppler effect and the decay of 
cosmic ray ^-mesons in the atmosphere, are now being discussed. 
Further examples based on the application of the Mossbauer effect 
are given in Section 8.4. 


3 . 10 . 2 . The Doppler Effect 

The possibility that the motion of a source of light may affect the 
position of a line in a spectrum was pointed out by Doppler in 1842 . 
At present, a simplified treatment of the Doppler effect is given; 
the general case is treated in Section 4 . 4 . The present treatment is 
designed to try and bring out the role of the Lorentz time trans¬ 
formation. Let a source of light, which is at rest at O' the origin of 
E‘, emit light of frequency v’ when measured in S’. The emission of 
each maximum of the wavefront is treated as an event. Let the 
observer be at rest at O the origin of E. If E' moves with uniform 
velocity v relative to E, then, after the origins coincide, the source 
of light is moving with uniform velocity v away from the observer 
at rest m E. If a wave crest is emitted from O’ at a time t' measured 
in 2T, then according to the Lorentz transformations, relative to E, 
this event would be recorded as happening at a position 

x = y(x' + vt') = yvt’ 

at a time 

t — y(t' -f- vx'/c 2 ) = yt’ 

This light then travels with a velocity c in vacuo from O' to 0, and 
reaches O at a time xjc later. Hence, the wave crest reaches O at a 
time 

T = 1 + ~ c = + y t = y*'(i + ■:) 
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Let the next maximum be emitted from O' at a time t' + 
measured in S'. This maximum reaches 0 at a time 


r+ir = ,(.' + i)(i+j) 

Hence 



where v is the frequency of the light reaching 0 measured in S. 



(3.27) 

(3.28) 


Expanding to first order of v/c, one has 

v = v'(l — |a/c)( 1 — Ivjc) = /(l — vjc) (3.29) 


Hence, when the source is receding from the observer the frequency 
of thfe light should go down and the spectral line should be displaced 
towards the red end of the spectrum. If c — Xv in S, and c = AV 
in S', then eqn (3.27) can be rewritten as 


A = 


A'(l + vie) 
V1 — a 2 /c 2 


To first order, 

A = A'(l + o/c) 

When the source is approaching the observer. 


v — V ^' - vjc_ __ v > j c + v v ' p) c ) to first order 

(1 — vjc) ■ 'sc — v 


and 


A'(l — vie ) 

A = 7 = = ^ A'(l — vjc ) to first order 

V 1 — v z jc 2 ‘ 


(3.30) 

(3.31) 


(3.32) 


(3.33) 


Ives and Stilwell 10 measured accurately the apparent frequency of 
the blue-green Hp line of wavelength 4861 A emitted by hydrogen 
atoms moving at high velocities in canal rays. The positive particles 
were produced in a hydrogen arc and were accelerated in an electric 
field. The spectrum of the light emitted in the direction of motion 
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of the canal rays was viewed directly by a spectrograph. Using a 
mirror the light emitted in the direction at 180 degrees to the 
direction of motion of the canal rays was reflected back on to the 
slit of the spectrograph. A typical photograph obtained by Ives and 
Stilwell is shown in Figure 3.11 {a). The photograph consists of a 



Figure 3.11. (a), (b) and (c) Typical spectra obtained by Ives and Stilwell 10 . The spectra 
consist of central undisplaced lines with ‘symmetrically' placed lines on either side; the 
accelerating potential differences were (a) 7859, (A) 13702, and ( c ) 20755 V respectively; 

(d) the second-order shift X m — X’ = “ plotted against the first-order Doppler shift 

A A = [vjc)X’ for various accelerating potentials. It can be seen that the experimental points 
are in very good agreement with the predictions of the theory of special relativity, which are 
shown by the continuous line. (By courtesy of the Editor, Sci. Proc. Roy. Dublin Soc 11 .) 

central line with symmetrically placed lines on either side. There 
were always some stationary atoms present and they gave rise to the 
central line. On either side of the central line are the displaced 
lines; the lines due to canal rays coming towards the spectrograph 
are displaced towards the blue, and those due to the reflected light, 
emitted by the same canal rays in a direction directly away 
from the slit, are displaced towards the red end of the spectrum. 
In Figure 3.11(a) there are two lines on either side of the central line. 
These correspond to molecular aggregates of hydrogen of double 
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and triple mass respectively. To first order of o/c both the classical 

v 

and relativistic theories predict a shift of AA = ± - A', where X is 

c 

the wavelength of the light in the inertial frame in which the 
source of light is at rest. The displacement AA enabled Ives and 
Stilwell to calculate v/c in each case. Ives and Stilwell calculated 
the average wavelength A m of the displaced lines. According to the 
theory of special relativity, from eqns (3.30) and (3.33) one has 


therefore 


K = 2 = £[A'(1 - vie) + X{\ + vie)] 

_ X 
~ Vl - v 2 /c 2 


A m — X 


= X 


1 

-Vl - » 2 /c 2 




(3.34) 

(3.35) 


Typical sets of results obtained by Ives and Stilwell are shown in 
Table 3.1 and in Figure 3.11 (d). It will be seen that for all accelerat¬ 
ing potentials the results are in good agreement with the predictions 


Table 3.1 


Accelerating potential 
difference (V) 

AA = - X (A) 

C 

i-.X 
c 2 

calculated (A) 

observed (A) 

26,735 

25-82 

0-0670 

0-067 

34,395 

29-40 

0-0869 

0-090 

40,190 

31-93 

0-1049 

0-0995 

42,260 

32-50 

0-1098 

0-113 


of the theory of special relativity. For second order terms, the 
predictions of the classical theory depend to some extent on the 
velocity of the ether wind, but whatever value is chosen for 
the ether wind the predictions differ from eqn (3.35) in terms of 
the second order. Thus the relativistic theory of the Doppler effect, 
which is based so intimately on the Lorentz time transformation, 
was confirmed by Ives and Stilwell to terms of second order of v/c. 

3.10.3. The Decay of Cosmic Ray (i-mesons in the Atmosphere 

According to the law of radioactive decay, if a number N 0 of 
radioactive atoms are at rest at the origin of an inertial frame E' at 
t’ = 0, then after a time f' the number remaining is given by 

N = N 0 t~ i ’ /T <> (3.36) 
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where T 0 is the average time a radioactive atom lives before it 
decays when it is at rest. The radioactive atoms are moving along 
the x axis relative to the inertial frame E, so that according to the 
Lorentz transformations corresponding to a point x' = 0 at a time 
t' in E', one has in E, x = yvt' and t = yt'. Substituting in eqn 
(3.36) one obtains 

N = N 0 e~V vT ° (3.37) 

Hence, relative to E, the radioactive atoms should be measured to 
live an average time of yT 0 and not T 0 , so that the average lifetime 
of radioactive atoms should be measured to be longer when the 
radioactive atoms are moving relative- to the laboratory. No 
experiment has so far been performed with radioactive atoms. A 
variety of fundamental particles have been discovered having 
masses between the mass of the electron and the mass of the proton. 
These undergo spontaneous decay and have short lifetimes. The 
first to be discovered was the //-meson, which was first observed in 
the cosmic radiation. The average lifetime of //-mesons when they 
are at rest has been shown experimentally to be T 0 = 2-2 X 10~ 6 sec. 
(Rosser 12 , Section 3.8.2.) 

Now the //-mesons in the cosmic radiation are generally produced 
near the top of the atmosphere as a result of the nuclear interactions 
produced by the primary cosmic radiation. In these interactions 
7r-mesons are produced, which subsequently decay into //-mesons. 
Now, if //-mesons only lived for a time of T 0 = 2-2 X 10~ 6 sec 
relative to the earth, even if they travelled with the velocity of 
light, they would only travel an average distance of cT 0 = 3 X 
10 8 X 2-2 X 10 -6 m — 660 m before decaying. If this were correct, 
virtually none of the //-mesons produced near the top of the atmos¬ 
phere would be expected to reach sea level, whilst in fact a sub¬ 
stantial proportion of the //-mesons do. Hence, the //-mesons 
reaching sea level must either have velocities exceeding the velocity 
of light or must five longer than 2-2 X 10 -8 sec relative to the earth. 
No velocities exceeding the velocity of light have been recorded and 
so the phenomenon has been interpreted as an increase of the 
apparent lifetime of the //-mesons by a factor y — 1/(1 — r 2 /c 2 )*, 
where v is the velocity of the //-mesons relative to the earth. This 
is in agreement with the predictions of the theory of special rela¬ 
tivity. The //-mesons at sea level have momenta ~3 x 10® eV/c 
corresponding to y ~ 30. According to eqn (3.37) the lifetime of a 
//-meson of momentum 3 X 10® eV/e should be ~30 x 2-2 X 10 -6 
sec relative to the earth, and such a //-meson should travel an 
average distance ~20 km relative to the earth before decaying, and 
therefore should have a substantial probability of reaching sea level. 
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Hence, the behaviour of the //-meson in the cosmic radiation can be 
accounted for satisfactorily in terms of the theory of special relativity. 
It is interesting to interpret the co-ordinates and times of the events 
of creation and decay of a //-meson in terms of the Lorentz trans¬ 
formations. Normally, the lifetimes of //-mesons are distributed 
statistically, but for purposes of discussion, it will be assumed that 
a //-meson lives for a time T 0 in the inertial frame in which it 
remains at rest. Ionization loss will be neglected, so that the velocity 
of the //-meson remains constant relative to the earth. Let the 
origins of 2 and S' coincide at t = t' = 0 at the point of creation of a 



Figure 3.12. (a) A pt-meson is created at the origin of S' at the instant t' = 0 
when the origins of S and S' coincide. If the fit-meson moves with uniform 
velocity v relative to S it remains at rest at the origin of S'. ( b) The position 
of the ft-meson when it decays 

//-meson, as shown in Figure 3.12(a). Let S' move with the same 
velocity v as the //-meson so that the //-meson remains at rest at 
O', the origin of S’. In S' the //-meson would be created at x = 0, 
t' = 0 and would decay at x = 0 , t' = T 0 . In S it would be 
created at x = 0, t = 0 and would decay at 

x = y(x' + vt') = yvT 0 
t = y(t' + vx'l c 2 ) = yT 0 

Hence, an observer in S (that is on the earth) would say that the 
//-meson travelled with a velocity v for a time yT 0 covering a 
distance yvT 0 , so that relative to the earth we have time dilation. 
An observer at rest in S' would say that, relative to the //-meson, 
the origin of S and also the earth had travelled with a velocity v 
for a time T 0 covering a distance vT 0 , by the time the //-meson was 
observed to decay in S'. This distance is less, by a factor y, than 
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the distance travelled by the /z-meson relative to the earth. Rela¬ 
tive to the jtt-meson, there is no time dilation, but the earth and the 
earth’s atmosphere are Lorentz contracted. 

3.10.4. The Decay of ir-mesons in the Laboratory 

Beams of charged 7 r-mesons of kinetic energies up to 6 GeV can 
now be produced in the laboratory by accelerators, such as the 
proton synchrotron at G.E.R.N. Charged 7 r-mesons undergo 
spontaneous decay into /^-mesons and neutrinos. The mean lifetime 
of 7 r-mesons, when they are at rest, is 2-55 X 10 _ 8 sec. The rest 
mass of ir-mesons is 139-6 MeV/c 2 . 

Problem —Calculate the average distance (mean free path) 77 -mesons of 
velocity (a) 0-75 c, ( b ) 0-9c ( c ) 0-99c (d) 0-999c (e) 0-9995c (corresponding to 
kinetic energies of approximately (a) 72 MeV (6) 180 MeV (c) 850 MeV 
( 1 d ) 3 GeV (e) 4-4 GeV respectively) will travel before they undergo spon¬ 
taneous decay. (The formula relating kinetic energy and velocity will be 
developed in Chapter 5. 1 GeV = 10® eV and 1 MeV = 10 8 eV. The 
electron volt is defined in Section 5.3.3.) 

From eqn (3.37) the mean lifetime of moving 7 r-mesons relative 
to the laboratory is yT 0 , where T 0 = 2-55 x 10 ~ 8 sec and 
y — (1 — /S 2 ) _i where P = ufc. If u is their velocity, the average 
distance 7r-mesons will travel before undergoing spontaneous decay 
is uyT 0 . If P is 0-75, y = 1-512. Hence, 

yT 0 = 1-512 X 2-55 X 10 ~ 8 sec = 3-87 X 10“ 8 sec 

Mean distance = uyT 0 = 0-75 c X 3-87 X 10 -8 sec = 8-7 m. 

The values for the other cases are 


(b) 

/3 = 0-9 

y = 2-294 

yT 0 = 5-84 X 10 _8 sec 

W 

ft = 0-99 

y = 7-089 

yT 0 = 18-1 X 10“ 8 sec 

(d) 

ft = 0-999 

y = 22-37 

yT 0 = 57 X 10 -8 sec 


P = 0-9995 

y = 31-63 


yT 0 = 81 X 10 -8 sec 


uyT 0 = 15-7 m 


uy T 0 = 54 m 


uyT 0 = 170 m 


uy T 0 — 244 m 


An experiment with 7r-mesons of velocity ~0-75c and kinetic 
energy ~72 MeV was carried out by Durbin, Loar and Havens 13 
in 1952. They found experimentally that the average distance such 
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7r-mesons travelled before they decayed was 8-5 4: 0-6 m. If there 
were no time dilation, and if their velocity were 0-75e, the ir-mesons 
would only travel an average distance of 5-7 m before decay. 

More recently, experiments have been carried out with 7r-mesons 
of kinetic energies up to ~6 GeV. The above problem shows that 
77-mesons of kinetic energy 4-5 GeV should travel an average 
distance of about 250 m, before undergoing spontaneous decay. 
Experiments have shown that the speed of light is a limiting speed 
for particles (cf. Section 5.4.2). Thus, but for time dilation, 
77--mesons could only travel a maximum average distance of 
2-55 X 10~ 8 e or 7-6 m before decaying. In practice, high energy 
nuclear physicists can place their 77-meson detectors, such as 
bubble chambers, at a distance of 100 m or more from the point 
where the 7r-mesons are produced, since, allowing for time dilation 
the mean free path for decay is about 250 m for 4-5 GeV 77 -mesons. 
But for time dilation the intensity of 4-5 GeV 77 -mesons would 
decrease to exp ( — 100/7-6) or ~2 X 10~ 6 of its original intensity 
in 100 m. Since the speed of light has been shown to be the limiting 
speed for particles, the 77 -mesons must live longer than 2-55 x 10~ 8 
sec relative to the laboratory, so that these experiments confirm 
time dilation. Relative to the reference frame in which the 77 -mesons 
are at rest, the dimensions of the laboratory are Lorentz contracted 
in a direction parallel to the direction of relative motion. 

High energy nuclear physicists are now working in an energy 
range where the use of the equations of the theory of special 
relativity is imperative. To quote Kittel, Knight and Ruderman 14 : 

It has been said that almost every high-energy physicist tests special 
relativity every day. He uses the Lorentz transformation with the same 
confidence that physicists in the nineteenth century used Newton’s laws. 

It has been shown that the predictions of the theory of relativity 
agree with the experimental results in the case of the Doppler effect 
and in the case of /t-meson decay. This gives one confidence in the 
use of the time transformation. It must be pointed out that time 
appears as a variable in the interpretation of many other phenom¬ 
ena, for example, velocity is equal to the distance travelled per 
unit time. The successes of the relativistic velocity transformations 
in interpreting experiments are indirect evidence in favour of the 
time transformation. Momentum is defined as the product of mass 
times velocity. The development of relativistic dynamics in Chapter 
5 is based on the Lorentz transformations, and the successes of this 
branch of the theory of special relativity are again indirect evidence 
in favour of the time transformation. 
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It must be emphasized that in this section it was assumed that 27' 
always moved with the same uniform velocity v relative to 27. Under 
these conditions, the two inertial frames were perfectly reciprocal. 
The discussion of what may or may not happen when a clock is 
accelerated, such that it undergoes a journey in a closed path is 
deferred until Chapter 8. Such a journey necessarily involves the 
acceleration of the clock from one inertial reference frame to 
another at some stage during the journey. 

Example. The furthest star in our galaxy is about 10 £ light year? away. 
Explain in terms of time dilation (or length contraction) how it is possible, 
in principle, for a human being to reach this star within his normal life span. 
Estimate what uniform velocity a rocket would require to reach the star in 
10 years, measured in the inertial reference frame in which the rocket is at 
rest. 

Let the earth be at the origin of an inertial frame 27, and let the 
rocket be at the origin of 27'. Let the rocket leave the earth at 
t = F = 0 and travel with velocity v, [y = (1 — » 2 /c 2 ) _i ] for a time 
t' = 10k relative to the rocket, where k is the number of seconds in 
a year. Let the rocket reach the star at a distance of 10 6 light years 
or 10 b kc metres from the earth at a time t relative to 27. From the 
Lorentz transformations, since x' = 0, we have 

t — y(t' + vx'/c 2 ) - yt' = y\0k 

This is the normal expression for time dilation. The time for the 
journey relative to 27' can be measured by one clock on the rocket 
and so is a proper time interval. The time for the journey relative 
to 27 must be measured by two spatially separated synchronized 
clocks at rest relative to the earth. If v is the speed of the rocket the 
distance travelled by the rocket relative to the earth in a time 
t — ylOk sec is y\Qkv. But the distance from the earth to the star is 
10 s light years or 10 5 kc metres. Hence 

10 5 kc = y\0kv 

{Alternatively, one can use the Lorentz transformation 

x = y(x' + vt') 

where x — 10 5 kc, x = 0 and t ' = 10k}. Hence 

10 4 c = v/V 1 — v 2 jc 2 
10 8 (1 - v 2 jc 2 ) = v 2 jc 2 
or 

v 2 jc 2 = 10 8 /(1 + 10 8 ) = 1/(1 + 10- 8 ) 

75 


~PufLC. Tfeat/tcwtofieu/ 



THE LORENTZ TRANSFORMATIONS 


or 

»^(l-5xl0- 9 )c (3.38) 

and 

y^= 10 4 

Thus due to time dilation, the measured time for the journey is less 
relative to the rocket than relative to the earth, so that, in principle, 
it is possible for an astronaut to reach the star in his normal life 
span. However, it will be shown in Section 5.10 that, due to fuel 
considerations, it is impossible to accelerate conventional rockets to 
such high speeds, so that, at present at least, the question is of 
academic interest only. However, the protons in the cosmic 
radiation do have such high speeds. For example, a proton of 
velocity given by eqn (3.38) would have an energy of ~10 13 eV. 
Protons of energies up to ~10 19 eV have been observed in the cosmic 
radiation. Protons of energy 10 19 eV have a value of y ~ 10 10 . 
If they were not deflected by galactic magnetic fields, such protons 
would reach the edge of the galaxy in about 5 min, measured in the 
rest frame of the proton. 

Relative to the rocket, the galaxy moves with a velocity — v, 
where v is given by eqn (3.38), for a time of 10 years covering a total 
distance of 10 kv 10 kc relative to the rocket. Relative to the earth, 
the distance from the earth to the star is 10 5 kc. Hence, relative 
to the moving rocket, the distance from the earth to the star is 
Lorentz contracted by a factor y ~ 10 4 . Relative to a proton of 
energy 10 19 eV the dimensions of the galaxy are Lorentz contracted 
by a factor y ~ 10 10 and, in the direction of relative motion of the 
proton and the galaxy, relative to the proton the galaxy would have 
roughly the dimensions of the solar system. 

3.11. THE PROPAGATION OF SPHERICAL WAVES 
IN EMPTY SPACE 

Consider a spherical array of photomultipliers at rest in empty 
space at a distance r from the origin of the inertial frame 27. If light 
is emitted from the origin of 27 at a time t = 0, then, according to the 
principle of the constancy of the velocity of light, the light should 
reach all the photomultipliers at rest in 27 at the same time t = rjc 
relative to 27. Similarly, if there is a spherical array of photo¬ 
multipliers at rest in 27' at a distance r' from the origin of 27', then, 
according to the principle of the constancy of the velocity of light, 
the light emitted from the origin of 27' at the time t' = 0, when the 
origins of 27 and 27' coincide, should reach all the photomultipliers 
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at rest in S' at a time t' = r'/c measured in S'. According to the 
principle of the constancy of the velocity of light, observers at rest 
in S and S’ respectively should both record that the light was 
travelling outwards as a spherical wave in their respective co¬ 
ordinate systems, and with their respective origins as centres of the 
wavefront. It will now be shown how these two, apparently 
contradictory, viewpoints can be reconciled using the Lorentz 
transformations. 

Consider two photomultipliers at rest at points A and B on the 
x axis of S' at distances +r' and —r' respectively from the origin 



Figure 3.13. The two photomultipliers A and B are at rest in S’ as shown in ( b ). Light 
emitted from the origins of S and S' at t = /' = 0 reaches A and B simultaneously relative 
to S'. The photomultipliers A and B are moving relative to S; B moves towards 0 so that 
light emitted from 0 at t = 0 reaches B before it reaches A, relative to S 


O', as shown in Figure 3.13(b). According to the principle of the 
constancy of the velocity of light, the light emitted from O' at the 
time t' = 0 should reach the photomultipliers at A and B simul¬ 
taneously at a time t' — r'/c in S'. However, an observer at rest in 
S would say that, initially, the photomultiplier at B was moving 
towards the light, whilst the photomultiplier at A was moving away 
from the light as shown in Figure 3.13(a). Relative to S the light 
should be detected by the photomultiplier at B before the light 
reaches A. According to the Lorentz transformations the observer 
in S would record the detection of the light at B at co-ordinates and 
time given by 

x b = y(— r> + vt ’); ( b = ?(*' — vr 'l c2 ) 
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and corresponding to the detection of the light by the photo¬ 
multiplier at A, he should record 

*a = r( r ' + vt> ); 1 a = y(*' + vr 'l c2 ) 

Thus, according to the Lorentz transformations, the observer at 
rest in Z would record the light reaching B before A, but he would 
also record that the distance the light travelled to B was less than to 
A. One has 

x a y\r' + 1 *'] 1 > 7 *' +»] 

U ~ v\? + vr 'l c2 ] ~ [1 + WWini 


Now, if r'/t' = c, then 


Similarly, 


Xjl _ g + p 
t A ~ 1 + v/c 


x bI*B — g 


A similar argument can be applied in the inertial frame £' to the 
detection of the light by the photomultipliers at rest in the inertial 
frame Z. Due to the different measures of both the spatial and 
temporal separation of the events of emission and detection of the 
light, the observers at rest in Z and Z' can both obtain the same 
numerical value for the velocity of light. They can both record the 
propagation of the light as a spherical wave with their respective 
origins as the centres of the wavefronts, though when one of the 
observers detects the wavefront with his ‘stationary’ array of 
photomultipliers the other observer would not record these events 
as simultaneous, nor on the surface of a sphere in his own frame of 
reference. 


3.12. INTERVALS BETWEEN EVENTS 

Let an event occur at the point x, y, z at a time t in the inertial 
frame Z. Let another event occur at the point x + dx, y + dy, 
z + 6z at a time t + St. In the inertial frame Z' let these events 
occur at x', y', z! and x' + dx', y' + by', z' + bz! at times t' and 
t' + St' respectively. According to the Galilean transformations 
one would have 

bx = bx' + v bt' 
by — by' 
bz = bz' 
bt = bt' 
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According to the Galilean transformations the temporal interval 
between the two events should be an invariant. If bt' and bt were 
zero, 

bx 2 + by 2 + <5z 2 = bx' 2 + by' 2 + bz' 2 

which means that the distance between two points measured at the 
same time in E and E' would be an invariant; that is, lengths are 
absolute according to the Galilean transformations. 

According to the Lorentz transformations one has 

8x = y[bx' + v dt'] 

by = by' 

bz = bz 

bt = yibt' + bx'/c 2 ] 

According to the Lorentz transformations the temporal interval 
between two events is not an invariant. Now 

bx 2 + by 2 + <5z 2 = y 2 (bx' 2 + 2v bx'. bt’ + v 2 bt' 2 ) + by’ 2 + bz' 2 

so that, even if bt' were zero, bx 2 + by 2 + <5z 2 would not be equal to 
bx' 2 + by' 2 + bz' 2 . However, 

bx 2 + by 2 + bz 2 - c 2 bt 2 

T v bx'l 2 

= y 2 [bx' + v <5f'] 2 + <5/ 2 + bz' 2 - c 2 y 2 \ bt' + 

= y 2 \bx' 2 + 2» bx' bt' + v 2 bt' 2 — c 2 bt' 2 — 2v bx' bt' 

- V 2 bx' 2 lc 2 ] + by' 2 + bz' 2 

= y 2 [_bx' 2 (l - v 2 /c 2 ) - c 2 bt' 2 (l - v 2 jc 2 )] + by' 2 + bz’ 2 
= bx' 2 + by' 2 + bz' 2 - c 2 bt’ 2 

so that 

bs 2 = bx 2 + by 2 + bz 2 - c 2 bt 2 (3.39) 

is an invariant in all inertial frames moving with uniform velocity 
relative to each other. The quantity bs is called the interval between 
the two events. [Some writers prefer to define bs 2 such that 

bs 2 — c 2 bt 2 — {bx 2 + by 2 + bz 2 ) 

We shall always define bs 2 according to eqn (3.39).] According to 
the theory of special relativity if two events occur, two observers, 
one at rest in E and the other at rest in E', will record different 
distance and time separations between the two events, but they will 
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record the same interval 8s. If 8s 2 is an invariant, then ds 2 cannot 
change sign when one transforms from one inertial frame to the 
other. If<5.y 2 is positive, as defined by eqn (3.39), then the interval 
8s between the two events is called a space-like interval, and, if 
<5 j 2 is negative, 8s is called a time-like interval. The space-like and 
time-like properties of 8s will be illustrated by a few numerical 
examples. 

Let one event occur at x = 0 at / = 0, and let the other event 
occur at x = 6c, t = 10 sec relative to the inertial frame 27. It is 
assumed for the remainder of this section that y and z are always 
zero for all the events. (The expression 6c represents 6 X 3 X 10 8 m 
and is assumed to have the dimensions of a length.) In 27 we 
have 

&r 2 = 8x 2 — c 2 8t 2 = (6c) 2 — c 2 10 2 = —64c 2 


so that 8s is a time-like interval. 

Now consider an inertial frame 27' moving with uniform velocity 
c/2 relative to 27. In 27' one has 



illustrating that <5.r 2 is invariant. 

Now let v = 3c/5, such that y = J. In this case 8x’ = 0, 8t' = 8 
sec; 8s 2 = —64c 2 . Notice that in this case 8x' is zero, so that in this 

inertial frame the two events are recorded at the same place. 

In general, 8x' is zero in the equation 

8x' = y(8x — v 8t ) 

when 

v 8x 

v = 8x[8t or - = (3.40) 

If 8s is time-like, <5.s 2 is negative, c 2 8t 2 > 8x 2 and c |<5f| > |<5x|. 

Thus, if 8s is time-like, it is always possible to satisfy eqn (3.40) with 
v < c and find a reference frame in which 8x' is zero. In this 
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reference frame, the time interval between the two events is a 
proper time interval, since the two events are recorded at the same 
position. Let the proper time interval be denoted by br. Equation 
(3.39) becomes 

ds 2 = bx 2 + by 2 + bz 2 — c 2 St 2 = — c 2 8t 2 

If v is given by eqn (3.40), 

, , / v dx\ , /, bx 2 \ 

8t j =y*(l - c T^i) 

Since ds 2 is negative, that is c 2 bt 2 > bx 2 , bt' always has the same 
sign as bt, so that, when bs is time-like, the time order of the two 
events is the same in all inertial reference frames. 

Consider an example in which the first event is again at x = 0, 
t = 0, but the second event is now at x — 10c, t = 6 sec in the 
inertial frame E. The interval bs 2 is now space-like and is equal 
to +64c 2 . In the inertial frame E' having velocity c/2 relative to 
E, one now has bx' = 14c/V3 and bt' = 2/V3 such that the interval 
is again 64c 2 . In an inertial frame moving with velocity v = §c 
relative to E, one has 

bx' = 8c; bt' = 0; bs 2 = +64c 2 

In this last reference frame, the events would be measured to happen 
at the same time. It is possible to choose such an inertial frame 
whenever bs 2 is positive and bs is a space-like interval. 

In an inertial frame moving with velocity v = fc relative to E, 
we have 

bx' = bt' - -— sec; bs 2 = +64c 2 

o 6 

Thus in the inertial frame moving with velocity fc relative to E, 
the ‘second’ event is recorded as happening before the ‘first’, so 
that the order of events is reversed. 

In the equation 



the time interval bt' is zero if 

v bt 
c C bx 

If bs is space-like, bs 2 is positive, bx 2 > c 2 bt 2 and |&c| > c |<5t|. 
Hence if bs is space-like, it is possible to find an inertial reference 
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frame in which 8t' is zero. If v/c > c dt/dx, but less than unity, 
then 8t' is opposite in sign to St and the time order of the events is 
different in Z and Z', as illustrated in the above numerical example. 

The reversal of the time sequence of events in different inertial 
frames is possible whenever 8s is a space-like interval. This predic¬ 
tion of the theory of special relativity provoked a lot of discussion at 
one time. Before the introduction of the theory of relativity and the 
advent of quantum mechanics, it was believed that physics was 
strictly deterministic. It was believed that a given cause always 
gives rise to the same effect. Under the influence of the prevailing 
ideas of absolute time, it was believed that it was possible to have a 
universal definite time order for all events, and all earlier events 
could in principle affect all later events. It seemed absurd that the 
order of some events could be different in different inertial frames. 
Now for events for which Ss 2 is positive, one cannot send a light 
signal from one event to the other as [<5x| is greater than c |<5f|. This 
can be checked for all the numerical examples given above. It 
will be shown in Chapter 5 that, according to the theory of special 
relativity, energy and momentum cannot be transmitted with a 
velocity exceeding the velocity of light. Hence, when & 2 is positive, 
what happens in one of the events cannot influence what happens 
in the other as light cannot go from one event to the other. There 
can be no direct causal connection between the two events, so that 
it is irrelevant which event is measured to occur first, as neither 
event can influence the other. One cannot influence what is going 
on at present on a star a thousand light years away, since, if one 
shines a torch at the star now, the light will not reach the star until 
1000 years hence. One cannot even influence what is going on on 
the moon at present, because a light signal would take T28 sec to 
reach the moon and a rocket would take much longer. When 8s 2 
is negative, that is 8s is a time-like interval, then it is possible to send 
a light signal from one event to the other since |(5x| < c |<5i[, as 
illustrated in the numerical examples. In these cases causal con¬ 
nection between the two events is possible. For example, one could 
influence what happens on the moon in a year’s time by sending a 
rocket to reach there in time. For events for which 8s 2 is negative 
there is a definite time order for the events, which cannot be 
reversed by transforming to any other inertial frame moving with 
velocity v < c relative to the first. These conclusions are illustrated 
graphically in Figure 3.14. It will again be assumed that the zeros 
of x and t are chosen such that one event takes place at the origin 
of the inertial frame Z at t — 0. The abscissa represents the 
x co-ordinate of the other event and the ordinate represents the 
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product of the velocity of light and the time of the other event. 
The paths of light rays going along the x axis and which are at 
the origin x = 0, t = 0 are lines given by x = ±ct and are at 
45 degrees to the axes as shown in Figure 3.14. Events in the region 
labelled FUTURE can be reached from the origin with velocities 
less than the velocity of light and can be influenced by what happens 
at the origin at t = 0. Events in the region labelled PAST could 


Path of particle of 
velocity less than c 



Figure 3.14. One event takes place at the origin at x = 0, t = 0. If another event takes 
place in the regions marked PAST and PRESENT, then &s 2 , the interval between the two 
events, is negative and causal connection between the two events is possible. If the other event 
takes place in the regions marked ELSEWHERE then 8 i 2 is positive and no causal connection 
is possible between the event and the event at the origin 

have sent signals to reach the origin at or before t — 0, and could 
influence what happens at x = 0, t — 0. Events in the regions 
labelled ELSEWHERE cannot send light signals to reach the origin 
by the time t = 0, neither can light signals be sent from the origin 
to reach these events in time to have any influence on them, so that 
these events can have no causal connection with what happens in 
the event at the origin at x = 0, t = 0. 

The intervals between the event at the origin and events in 
either the PAST or FUTURE regions are time-like (8s 2 < 0), and 
the intervals between the event at the origin and all events in the 
ELSEWHERE regions are space-like (8s 2 > 0). Future and past 
take on a new meaning in the theory of special relativity depending 
on whether or not causal connection is or was possible. With a 
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little imagination on the part of the reader, Figure 3.14 can be 
extended to events, for which the y and z co-ordinates of the events 
are not zero. 

3.13. THE ROLE OF OBSERVERS IN THE THEORY 
OF SPECIAL RELATIVITY 

In this chapter we have often discussed what an observer at rest 
in the inertial frame S or an observer at rest in the inertial frame 
S' would measure. For example in Section 3.6, we considered what 
an observer on rocket 1 and what an observer on rocket 2 would 
measure. The use of the term observers in such contexts led some 
people to conclude that the theory of special relativity required the 
intervention of observers in measurements in a way different from 
classical Newtonian mechanics, and for this reason the theory of 
relativity was assumed by many people to be more ‘idealistic’ and 
less ‘materialistic’ than Newtonian mechanics. In both Newtonian 
mechanics and in the theory of special relativity the measurements 
can be performed by instruments, and the information transmitted 
to a base where it can be fed into a calculating machine, and after- 
constructing and setting up the apparatus the ‘observer’ need only 
read and interpret the results when necessary. The role of the 
observer is very similar in both theories, so that on this point 
one theory cannot be said to be more idealistic or materialistic than 
the other. It is beyond the scope of this work to discuss this aspect 
of the two theories any further. The observers were introduced in the 
previous discussion of Newtonian mechanics in Section 1.4 and 
in the present chapter merely to try and make the exposition easier 
to understand. In future we shall generally merely state that a 
variable is measured to have a certain value in S rather than say 
that an observer in S would measure it to have that particular value, 
unless it helps the exposition to introduce imaginary observers. 

What the theory of special relativity did show was that some 
quantities such as length and time intervals, which had previously 
been assumed a priori to have invariant values in all inertial frames 
of reference, need not have the same numerical values in different 
inertial frames moving relative to each other, but their numerical 
values may depend on the arbitrary standard of rest chosen. It was 
illustrated in Sections 3.6 and 3.7 that these predictions are plausible 
when they are interpreted in terms of measurements that can 
actually be carried out in the laboratory. The theory of special 
relativity emphasized that the quantities used in a theory must be 
defined and measured in terms of that particular theory, and the 
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quantities used in a theory must be related directly or indirectly to 
measurements that can be performed in the laboratory. 

For purposes of discussion it was found convenient to think of the 
co-ordinate system S and S' as consisting of a series of ‘rigid’ rulers 
parallel to the axes, with a series of synchronized clocks distributed 
throughout space so that the co-ordinates and time of an event could 
be measured when and where it happened. It must be stressed 
that this model was a purely symbolic way of illustrating how the 
co-ordinates and time of an event might be measured. One could 
perform the measurements in practice using the radar techniques 
described in Section 3.9 and Appendix 6. 

REFERENCES 

1 Einstein, A. Amin. Phys. 17 (1905) 891 

2 James, J. F. and Sternberg, R. S. Nature, Land. 197 (1963) 1192 

3 Sadeii, D. Phys. Rev. Lett. 10 (1963) 271 

4 Alvager, T., Nilsson, A. and Kjellman, J. Nature, Land. 197 (1963) 
1191 

5 Babcock, G. C. and Bergman, T. G. Opt. Soc. Amer. (1963), Annual 
Meeting Program, p. 31 

6 Alvager, T., Farley, F. J. M., Kjellman, J. and Wallin, I. Phys. Lett. 
12 (1964) 260 

7 Lieber, L. R. and Lieber, H. G. The Einstein Theory of Relativity, p. 26, 
1949. London; Dobson 

8 Einstein, A. Relativity, The Special and the General Theory, p. 25, 1921. 
London; Methuen 

9 Einstein, A. The Meaning of Relativity, 3rd edn, p. 27, 1946. London; 
Methuen 

10 Ives, H. E. and Stilwell, G. R. J. opt. Soc. Am. 28 (1938) 215; 31 
(1941) 369 

11 Ives, H. E., Scient. Proc. Roy. Dublin Soc. 26 (1952) 9 

12 Rosser, W. G. V. An Introduction to the Theory of Relativity. 1964. London; 
Butterworths 

13 Durbin, R. P., Loar, H. H. and Havens, W. N. Phys. Rev. 88 (1952) 179 

14 Kittel, C., Knight, W. D. and Ruderman, M. A. Mechanics, Berkley 
Physics Course Vol. 1, p. 358, 1965. New York and London; McGraw-Hill 


PROBLEMS 

Problem 3.1 —What are the two main postulates of the theory of special 
relativity? What extra postulates are implicit in the theory of special 
relativity? 

Problem 3.2 —Write a critical essay on what is meant by ‘an inertial frame 
of reference’ in various physical theories. Discuss the scope and limits of 
applicability of the concept. 

Problem 3.3 —A rod of proper length L 0 is at rest in an inertial frame S’. 
The rod is inclined at an angle 6' to the x’ axis in S’. What is the length of 
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the rod in S, and what is its inclination to the x axis in S, if S' moves with 
uniform velocity v relative to S along the common x axis ? 

Problem 3.4 —By what amount is the earth shortened along its diameter 
(as measured by an observer at rest relative to the sun) due to the orbital 
motion of the earth around the sun? (Take the velocity of the earth as 
30 km/sec and the radius of the earth as 6367 km.) 

Problem 3.5 —A rocket is moving at such speed in the lab.-system that 
its measured length is half its proper length. How fast is the rocket moving 
relative to the lab.-system? 

Problem 3.6 —Show that the circle x' 2 +y' 2 — a 2 in S' is measured to be 
an ellipse in S, if S' moves with uniform velocity relative to S. 

Problem 3.7 —There are clocks at rest in both S and S'. The clocks at 
rest in S are synchronized with each other, and the clocks at rest in S’ are 
synchronized with each other. The origins of S and S' coincide at t — t’ = 0, 
and S' moves with uniform velocity v along the +x axis relative to S. At a 
later time t, which of the clocks at rest in S' read the same time as all the 
clocks at rest in S? [Hint: put t’ — tin the time transformation to obtain 

A i 

X = — {1 — (1 — v 2 jc 2 )l} t. 

The clocks can have any y and z co-ordinates.] 

Problem 3.8—A rocket of proper length 600 m is moving directly away 
from the earth with uniform velocity. A light (or radar) pulse is sent out 
from the earth and is reflected from mirrors at the back end and the front 
end of the rocket. If the first light (radar) pulse is received back at base 
200 sec after emission and the second pulse is received 17-4 ft- sec later, 
calculate (a) the distance of the rocket from the earth and (b) the velocity of 
the rocket relative to the earth. 

Problem 3.9 —How would you measure the length of a moving rocket? 
[Hint: An adaptation of the method of the previous example may be 
suitable.] 

Problem 3.10 —If the mean lifetime of a ft- meson when it is at rest is 
2-2 X 10~ 6 sec, calculate the average distance it will travel in vacuo before 
decay, if its velocity is (a) 0-9 c; ( b) 0-99 c; (c) 0-999 c. 

Problem 3.11 —-One thousand /x-mesons are produced at an altitude of 
40 km. How many ^-mesons should on the average reach sea level before 
they decay if the velocity of the /x-mesons is 0-999c and if their mean life 
is 2-2 X 10 -6 sec when they are at rest? (Assume the ft- mesons travel 
vertically downwards without energy loss due to ionization.) 

Problem 3.12 —Describe the experiment performed by Durbin, Loar and 
Havens [Phys. Rev. 88 (1952) 179] on the lifetime of moving charged 
77 -mesons. How did the measured lifetime of moving 7r-mesons compare with 
the accepted value for the mean lifetime of 7r-mesons when they are at rest ? 

Problem 3.13 —An astronaut wants to go to a star 5 light years away. The 
rocket accelerates quickly and then moves with uniform velocity. Calculate 
with what velocity the rocket must move relative to the earth, if the 
astronaut is to reach there in one year, as measured by clocks at rest on the 
rocket. 
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Problem 3.14 —Give an account of the Ives and Stilwell experiment on 
the second-order Doppler effect. Explain how their results can be inter¬ 
preted in terms of the Lorentz transformations. 

Problem 3.1 5—Calculate the Doppler shift in wavelength (using the 
relativistic formula) for light of wavelength 6000 A when the source is 
approaching the observer with velocity (a) 0-lc and (b) 0-5 c. 

Problem 3.16— One event occurs at the origin of the inertial frame Z 
at t — 0. A second event occurs at the point x — 5c, y — z = 0 at a time 
t — 4 sec relative to Z. Find the velocities, relative to Z, of the inertial 
reference frames in which (a) the events are simultaneous, ( b ) event 2 
precedes event 1 by 1 sec, and (c) event 1 precedes event 2 by 1 sec. 

Problem 3.17 —One event occurs at the origin of an inertial frame Z at 
the time t = 0. Another event occurs at x = 4c, y = z = 0 at a time t — 5 
sec relative to Z. (a) Determine the velocity (relative to Z) of the inertial 
frame Z' in which the two events are recorded at the same point in space, 
and ( b ) what is the time interval between the events in Z'. 

Problem 3.18 —If two events occur at the same time t = 0 at points 
x = y = z = 0 and x — X,y — z = 0 in an inertial frame Z, show that the 
interval between the two events is space-like. Show that by a suitable 
choice of inertial frame the spatial separation between the events can be 
made to vary between -f X and oo. Show that there is no limit on the time 
separation between the events, and the time order of the events may be 
different in different inertial frames. Discuss how the last result led to a 
refinement of the classical idea of causality, according to which all earlier 
events could influence all later events wherever they occurred. 

Problem 3.19 —The point O' in an inertial frame Z has Cartesian co¬ 
ordinates x = vt, y = z = 0 at time f, where v is a constant. A second 
frame Z' has O' as origin and Cartesian axes parallel to those of Z. Derive 
the Lorentz transformations between the co-ordinates of an event in Z and 
S'. 

In Z, events occur at the origin and at the point {X, 0, 0) simultaneously 
at t — 0. The time interval between the events in Z' is T. Show that the 
spatial distance between the events in Z' is (X 2 + and determine the 

relative velocity v of the two frames in terms of X and T. [Hint: Use 
the fact that the interval between the events is an invariant.] (Exeter 1960.) 
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4.1. THE VELOCITY TRANSFORMATIONS 
The Lorentz transformations for the transformation of the co¬ 
ordinates and time of an event from one inertial frame to another 
were derived in Chapter 3 from the principle of relativity, and the 
principle of the constancy of the velocity of light. In this chapter the 
transformations for the velocity and acceleration of a particle will 
be calculated on the basis of the Lorentz transformations. When a 
particle is moving relative to an inertial frame, its velocity is defined 
as the distance it moves per unit time measured in that inertial 
frame. Let a particle be measured to be at a point x, y, z at a time 
t in an inertial frame E, and let it be measured to be at the point 
x + dx, y -f- dy, z + dz, at a time t + dt. The velocity of the 
particle relative to E is defined as a vector u having components 

<5* dy dz 

Ux = di ; = & ; Uz = Tt 


(4.1) 


The magnitude of the velocity of the particle is equal to 


a = Va| + aj + a| (4.2) 

The two successive measurements of the positions of the particle are 
events, since they involve observations, such as the passing of the 
particle past marks on a scale, or the passage of a high energy 
charged particle through two scintillation counters. These events 
occur independently of any particular co-ordinate system. The 
co-ordinates and time of these events in different inertial frames are 
related by the Lorentz transformations. In an inertial frame E', 
moving with uniform velocity v relative to E along the common x 
axis, corresponding to x, y, z, t one has 

x' = y(x — vt); y =y; z' — z; t' = y[t — vxjc 2 ) (4.3) 


and corresponding to x + dx,y -j- dy, z + dz, t + dt one has 
x' + dx' = y[x + dx — v(t + <5f)]; y' + dy' —y + dy 

v(x + dx)' 


dt 


z! + dz' = z + dz; t' + dt' = + 

Subtracting (4.3) from (4.4), 

dx' = y\dx — v dt]; dy’ = dy; dz’ = dz; dt' = - 
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The velocity of the particle measured in S' has components 


II 

1 

1 

(bxfbt — v ) 

(»* - v ) 

(4.5) 

II 

1 

T.I « 
o» 

* 

ii 

1 

: &IJ: 

1 

(1 — vujc 2 ) 

, <5/ 

byV 1 - v 2 lc 2 (by/bt)V 1 - v 2 /c 2 

! 

< 

1 

% 

Cl 

L2 

(4.6) 

U y - 6( , - 

( 8t ~ ? dx ) 

1 

(1 - vujc 2 ) 


, bz’ 

u z V 1 — v 2 /c 2 


(4.7) 


Ul== W 

(1 - vujc 2 ) 



These transformations relate to the instantaneous values of the 
velocity in S and S' at some specific identifiable point on the path 
of the particle. The co-ordinates and time of this point in S and S' 
are related by the Lorentz transformations. Notice that, even 
though the increments by and by' are equal, u v and u y are not equal, 
the difference between them arising from the different measures of 
time intervals in S and S'. Notice, if u and v are very much smaller 
than c, then 

u z~^-u x — v; u'y-*u y and u' z -*u z 


This is in agreement with the velocity transformations of Newtonian 
mechanics. 

NOW '2 '2 l '2 , '2 

U 2 = uj + U 2 + U 2 

(u x - P )» + (a 2 + «|K 1 - vjf) 


(«» 


(1 - vujc 2 ) 2 

v ) 2 + ( U 2 - M*)(l - V 2 /c 2 ) 


(1 - vujc 2 ) 


■ 2\2 


(4.8) 


The inverse relations are obtained by interchanging primed and 
unprimed quantities and replacing v by — v. Eqns (4.5), (4.6), 
(4.7) and (4.8) become 


««, = 

K +» 

(4.9) 

(1 + vu'Jc 2 ) 

UyV 1 — 

Uy = 

(4.10) 

(1 + vu'Jc 2 ) 
u' t Vl - v 2 lc 2 

= 

(4.11) 

(1 + vu'Jc 2 ) 

„2 _ 

K + «) 2 + (“' 2 - < 2 )(! - v 2 !c 2 ) 

(4.12) 

u — 

(1 + vu'Jc 2 ) 2 
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The symbols u and u' will be used for velocities measured in the 
inertial frames 27 and 27' respectively. The symbol v will only be 
used for the relative velocity of the two inertial frames £ and 27', 
and the symbol y will always represent 1/(1 — & 2 /c 2 )*. 

As an example of the application of the velocity transformations, 
consider a ball which is rolling along the corridor of a train with a 
velocity u' x relative to the train (27'). If the train moves with a 
velocity v along the * axis relative to the embankment (27), then, 
according to the transformations of the theory of special relativity, 
the velocity of the ball relative to the embankment should be given 
by 


K + y) 

(1 + vu'Jc 2 ) 


As a typical example, let «' be 5 m/sec and let v be 30 m/sec, then 


(5 + 30) 

/ 5 X 30 \ 

V + (3 X 10 8 ) 2 / 


34-9999999999999417 m/sec 


According to Newtonian mechanics 

u x = u 'x + v = 5 + 30 = 35 m/sec 


This example illustrates how, in normal circumstances, the devia¬ 
tions from Newtonian mechanics can be neglected. 

Let a radioactive nucleus move with a velocity v = 0-2e along 
the x axis of the laboratory system 27. Let it emit a /9-particle of 
velocity u’ x = 0-95c relative to the inertial frame 27' in which the 
radioactive nucleus is at rest. If the /9-particle is emitted along the 
x' axis of 27', its speed relative to the laboratory is given by 


u' x + v 0-95r + 0-2c 
= 1 + vu'Jc* = 1 + 0-2 X 0-95 


0-966c 


According to the Galilean transformations the speed should have 
been 0-95c + 0-2c = l-15c. Thus, in nuclear physics, the deviations 
from Newtonian mechanics are very important. 

If one puts u' x — w and «(, = «' — 0 in eqns (4.9), (4.10) and (4.11), 
then, in the inertial frame 27, one would have 


w + v 
1 + vwjc 2 ; 


a„ = 0, u z = 0 


(4.13) 


In Section 3.4 an inertial frame 27" was considered, moving with 
uniform velocity w relative to 27' along the x axis, whilst 27' moved 
with uniform velocity v relative to 27 along the x axis. By applying 
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the Lorentz transformations twice, it was shown that the origin 
of E" moves with a velocity (w + »)/(1 + vw/c 2 ) relative to E. This 
is in agreement with eqn (4.13). 

If u' x = c, then in E: 

c + v 

U * = (1 + cc/c 2 ) = " 

Thus the velocity of light in vacuo has the same numerical value in 
E and E', illustrating that the velocity transformations are consistent 
with the principle of the constancy of the velocity of light, as, of 
course, they should be. 

If u = ac and v = /5c, then 

ac + /5c / a + /5 \ 

^ _C \1 +a/5 ) 

Provided a and /5 are both less than unity, then (a + /5)/(l + a/5) is 
less than unity, as direct substitution of numerical values for a and 
/5 will show, and u x will then always be less than c. Velocities 
exceeding the velocity of light cannot be obtained by the com¬ 
position of a number of velocities, which are themselves less than c. 


4.2. THE TRANSFORMATIONS FOR (1 - a' 2 /c 2 )* 

AND (1 - u 2 /c 2 )* 

The quantities (1 — a' 2 /c 2 )* and (1 — « 2 /c 2 )* occur very frequently 
in relativistic formulae. The transformations for these quantities 
will now be derived. From eqn (4.8) 


Thus, 

1 - a' 2 /c 2 = 1 - 
j 2vu 


(«* — z>) 2 + (a 2 — m 2 )(1 — v 2 jc 2 ) 
(1 - vujc 2 ) 2 


(4.8) 


(1 - vujc 2 ) 2 
2 vu„ v 


v 2 u 2 


i-ii'-i'i.rrn? _ r— _ +r® 4. __ 

I .4 .2 ~ .2 .2 -2 ~ .2 ~ -4 


(1 - vujc 2 ) 2 

1 — c 2 /c 2 — u 2 jc 2 + c 2 a 2 /c 4 (1 — c 2 /c 2 )(l — a 2 /c 2 ) 

(1 — vujc 2 ) 2 ~ (1 — vujc 2 ) 2 

Taking the square root, 


Vl - a' 2 /c 2 = 


V(1 — v 2 jc 2 )( 1 — « 2 /c 2 ) 


(1 - vujc 2 ) 


(4.14) 
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In eqn (4.14), u' is the total velocity of a particle measured in 27', 
and u is the total velocity of the particle measured in 27. The left- 
hand side of eqn (4.14) involves only quantities measured in 27', 
and apart from v, the right-hand side contains only quantities 
measured in 27. Eqn (4.14) is the transformation for (1 — m' 2 /c 2 )L 
Similarly, for the inverse relation, 

vt^_i 2TEWEZW (4.15) 

(1 + vujc 2 ) 

Eqns (4.14) and (4.15) will be used extensively throughout the 
text and the reader should be thoroughly familiar with them. 


4.3. THE TRANSFORMATIONS FOR THE 
ACCELERATION OF A PARTICLE 


In the inertial frame 27 the acceleration a of a particle is defined as 
the rate of change of the velocity of the particle. It is a vector having 
components: 

da* du v d “* ,, 

a “ ~ dt ’ ° v ~ dt l a *~ df ( 4 ‘ 16 ^ 

Similarly, the acceleration a' measured in 27' is a vector having 
components 

d m' , d u', , d*4 

d? 


._- . > _ d % . i 

x ~ dt' ’ v ~ dt' ’ z 


(4.17) 


Now 


da' 


dt' 


du' x dt 
dF x d? 


du'Jdt 

dt'/dt 


_ d ( («* — p) \ /d_ 

dt\(l — vujc 2 ) // dt 


[y(t - W f2 )] 


It is being assumed that 27' moves with uniform velocity relative 
to 27, so that v and y are constants. Carrying out the differentiation 
and remembering that a x — dujdt etc., one obtains 

™jc 2 )a x - (a* ~ v){-l)vajc* 


/ i ro 

a * ( vu x \ L 

H 1 -#, 


(1 - vujc 2 ) 2 


£ 


(, vu x \ 

- 7T + IF - ^ 2 ) 
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that is 

, (1 _ 0 2/ c 2)S 

vu^ a * 

, Ally AUy /At' 

ay = IF =z ~dtjdi = 



(4.18) 




7, vu *\ i va * 

\ c 2 + “» c 2 


Hence, 


Similarly, 


[•-?] K 1 -?) 

’ (-?) 
di£ (1 - v 2 /c 2 ) ( 

i-^r 


az== d? = 


+ (1 - vujc*) a *) 
vujc 2 


(- 5 )' 


+ 


(1 - vujc 2 ) 


(4.19) 


(4.20) 


The transformations (4.18), (4.19) and (4.20) relate the in¬ 
stantaneous values of the acceleration of a particle, measured in Z 
and Z' respectively, at some identifiable point on the path of the 
particle; the co-ordinates and time of this point in Z and Z' are 
related by the Lorentz transformations. 

If u and v are very much smaller than c, then a' = a. This is in 
agreement with Newtonian mechanics, according to which theory 
the acceleration of a particle should be the same in all inertial 
frames moving with uniform velocity relative to each other [compare 
eqn (1.13)]. 

If in the inertial frame Z, a particle moves with uniform accelera¬ 
tion a x in the x direction, then u x , the velocity of the particle in the 
x direction, varies with time. The expression on the right-hand side 
of eqn (4.18) includes a term in u x , which varies with time, so that 
a x does not remain constant even if a x remains constant. 

In Newtonian mechanics the mass of a particle was assumed to be 
an invariant independent of the velocity of the particle. It was 
then possible to define the force acting on a particle as the product 
of the mass and the acceleration of the particle. It will be shown in 
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Chapter 5 that in relativistic mechanics the mass of a particle must 
depend on its velocity, and it is not convenient to define the force 
acting on a particle as the product of the mass of the particle and the 
acceleration produced by the force. The discussion of the relativistic 
definition of force is deferred until Chapter 5. 

4.4. RELATIVITY OPTICS 
4.4.1. Fizeau’s Experiment 

In 1851, Fizeau showed that the speed of light in moving water 
was affected by the motion of the water. A more accurate experi¬ 
ment was carried out by Michelson and Morley in 1886, who 



Figure 4.1. In £' the water is at rest , and the velocity of light is the same in all directions 
and is equal to c\n’. (a) In £ the water is moving with velocity v. According to the velocity 
transformations of the theory of special relativity the velocity of light in the direction of 


motion of the water is equal to [cjn') + vyi — in £ 

showed experimentally that, if ri is the refractive index of light in 
stationary water, the speed of light in water moving with velocity v 
was equal to cfri + »(1 — 1 /n' 2 ) when the light travelled in a 
direction parallel to the direction of water flow. (See Rosser 1 
Section 2.1.7.) This result will now be interpreted in terms of the 
velocity transformations of the theory of special relativity. 

Let E be the laboratory system in which the water is moving 
with uniform velocity v in the positive x direction. Let a narrow 
pencil of light be selected by a series of slits such that the pencil 
of light travels in the positive x direction as shown in Figure 4.1(a). 
In the inertial frame 2' moving with uniform velocity v relative 
to 2 the water is at rest as shown in Figure 4.1 (b). Let the refractive 
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index of the light in S' be equal to n . The velocity of light in S' 
is given by 

u x — c/n’i u'y = 0 ; = 0 


Applying eqns (4.9), (4.10) and (4.11) one obtains 


u v = u z = 0 


(«« + v) 

(1 + vu'Jc 2 ) 



Expanding by the binomial theorem and neglecting terms of the 
order of v 2 jc 2 : 



v) 

1 + — 

1 - “ 

l c Jl 

n c) 


« a . = -^7 + »|l-+ higher order terms (4.21) 


This value for the speed of light in moving water is in agreement with 
the results obtained by Fizeau and by Michelson and Morley. This 
shows that the velocity transformations of the theory of special 
relativity give the correct velocity relative to the laboratory. If the 
Galilean velocity transformations were applied, one would have 


= u ' x + v = -, + v 


This value is not in agreement with the experimental results. In 
order to account for eqn (4.21) in terms of the classical ether theories, 
it was necessary to postulate that the moving water dragged some 
of the hypothetical ether inside the water along with it. 

An interesting feature of the present result is that it was not 
necessary to discuss the actual processes taking place in the water 


which give rise to the value of ( cjn') + v 



for the velocity of 


light in moving water. It can be seen that relativistic considerations 
alone demanded the above result. In order to account for the result 
in terms of electromagnetic theory, Lorentz extended Maxwell’s 
equations to the case of moving media and introduced hypotheses 
about the structure of matter. However, if the equations of the 
new theory obey the principle of relativity and remain invariant in 
mathematical form when the co-ordinates and time are changed 
according to the Lorentz transformations (which Lorentz’s equa¬ 
tions did to first order of r/c), then the values for velocities must 
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transform according to the transformations of the theory of special 
relativity and not Newtonian mechanics. Such a state of affairs 
often arises in the theory of special relativity, when the requirement 
that the equations of a theory be consistent with the postulates of 
the theory of special relativity is sufficient to yield some results, 
without having to discuss in detail processes actually taking place. 


4.4.2. The Aberration of the Light from Stars 

The phenomenon of aberration was discovered by the astronomer 
Bradley in 1728. Bradley observed a variation in the apparent 
position of a star at different times of the year. The phenomenon of 
aberration will now be interpreted in terms of the velocity trans¬ 
formations of the theory of special relativity. 

Consider the inertial frame E in which the sun and the star are 
at rest. Choose the directions of the y and z axes such that the star 
lies in the xy plane. Let the earth move with velocity v relative to 
E along the positive x axis as shown in Figure 4.2(a). In the inertial 
frame E', the earth is at rest as shown in Figure 4.2(b) ; thus E’ 
represents the geocentric system. The telescope selects a pencil of 
light which travels with the ray velocity. Let the inclination of the 
light relative to the x axis be equal to a and oq in E and E' re¬ 
spectively as shown in Figures 4.2(a) and 4.2(b). The telescope must 
be pointed at an inclination oq in E' for the light to enter the tele¬ 
scope along the normal to the telescope. In E the light velocity has 
components 

u x = —c cos a; u y = — c sin a; u z = 0 
According to the velocity transformations 


u x — v 

(1 - vujc*) 


—c cos a — v 


1 + cos a 


(4.22) 


u v V 1 — v 2 /c 2 —c sin aVl — v 2 /c 2 


vuf f i vc cos a" 

[/ + 


Dividing eqn (4.23) by eqn (4.22) one obtains 


tan oq 


, u' v c sin «V 1 — v 2 /c 2 1 + (vjc) cos a 

( c cos a + v) 

V i - » 2 /dn 


(1 + - cos <x) 


, sin aV 1 — v 2 jc : 
tan oq =-;—-= tan a 


cos a + vjc 


1 H— sec a 
c 
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r 

-FtThe geocentric system 



(a) (b) 


ytThe geocentric system moves 
with velocity ♦/relative to-F 



M (d) 


Figure 4.2. (a) The earth is moving with velocity +v relative to the sun. (4) The geocentric 
system corresponding to (a); the telescope must be set at an inclination to the x' axis if the 
light from the star is to enter the telescope normally in the geocentric system, (e) Six months 
later, the earth is moving in the opposite direction relative to the sun. (d) The telescope must 
now be set at a different inclination namely a' in the geocentric system if the light is to enter the 
telescope normally in the new geocentric system 

The angle oq is the angle the telescope must be set in the geocentric 
system in order that the ray of light enters the telescope along the 
axis of the telescope, when the earth is moving in the direction shown 
in Figure 4.2(a). Note oq is less than a. Six months later the earth 
is moving in the opposite direction relative to the sun as shown in 
Figure 4.2(c), This change in direction is equivalent to moving the 
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telescope from an inertial frame moving with velocity +» relative 
to Z, to one moving with velocity — v relative to Z. After six months 
the light from the star comes in at a different angle a 2 in the geo¬ 
centric system, where oco is given by 


, sin aV 1 — v 2 jc 2 
tan a 2 =- ; - = tan a 


cos a 


v/c 


[ Vl — p 2/ c 2- 

1-sec a 

c 


(4.25) 


Eqn (4.25) is obtained by replacing & by — v in eqn (4.24). Since 
a 2 is not equal to oq, the inclination of the telescope, which is at 
rest on the earth, must be changed in the course of six months, 
if the star is to remain in the field of view. Now v r+j 30 km/sec 
~3 X 10 4 m/sec, whereas c = 3 X 10 8 m/sec, hence terms of the 
order of v 2 /c 2 can be neglected. Hence eqn (4.24) becomes 

/ sin a /. , v \ _1 

tan a, =- ;—f = tan a I 1 H-1 

1 cos a + v/c \ c cos a/ 

Expanding by the binomial theorem and neglecting terms of the 
order v 2 /c 2 

, v tan a 

tan a, = tan a —- 

1 c cos a 

that is, 

v tan a 

tan a, — tan a = —- (4.26) 

1 c cos a 


Let tan oq = tan (a + Aa) where Aa < a. 

Expanding in a Taylor series 

tan (a + Aa) tan a + Aa ^ (tan a) = tan « + Aa sec 2 a 
Hence 

tan (a + Aa) — tan a = Aa sec 2 a 


Comparing with eqn (4.26), 

v tan a 

Aa sec 2 a =- 

c cos a 


or 

Aa = —(v/c) sin a 

This means that a( is less than a, so that the telescope has to be 
pointed at an angle less than a to the horizontal in the geocentric 
system, when the earth is moving in the direction shown in Figure 
4.2(a). 
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Similarly, six months later 

Aa = +{v/c) sin a 

In this case the inclination of the telescope to the horizontal is 
greater than a, and greater than the inclination six months earlier. 
Due to the motion of the earth around the sun the inclination of the 
telescope has to be changed between the limits given by eqns (4.25) 
and (4.24) if the star is to remain in the centre of the field of view. 
If the star is overhead, the star appears to move in a circle for 
which Aa = v/c = 10 -4 rad = 20-5 seconds of arc. This result is 
in agreement with the experimental value and also with the value 
derived on the basis of the classical ether theories. 

4.4.3. The Transformation of Plane Waves in Vacuo 

The transformation of a plane wave in vacuo will now be con¬ 
sidered using the transformations of the theory of special relativity. 
Without loss of generality they and z axes can be chosen such that 



Figure 4.3. The transformation of a plane wave in vacuo. The direction of the wave normal 

is different in 2 and 2' 

the normal to the wavefront lies in the xy plane. If the normal makes 
an angle 6 with the x axis in 27 as shown in Figure 4.3(a), then the 
monochromatic plane wave can be represented by 

a o /, is a r, / x cos 0 + y sin 0\ 
yt — A cos 2nv(t - rjc) = A cos 2nvl t -1 

where v is the frequency of the light, c is the phase velocity of light 
in vacuo, y> can represent either the electric intensity vector E or 
the magnetizing force H. The phase (t — r/c) can be interpreted 
as follows. Let a wave crest pass the origin of 27 at the time t = 0 
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and let this wave crest be given a label. Let an observer at rest at 
the point P in Z start counting when this labelled wave crest 
reaches him at a time rjc later, and let him continue to count until 
the time t. Since he counts for a time t — rjc, the number of wave 
crests he counts is equal to v(t — r/c). 

Now, let the inertial frame Z' move with uniform velocity v 
relative to Z along the common * axis as shown in Figure 4.3(b), and 
let the origins of Z and Z' coincide at t = t' = 0. Let an observer 
be at rest at a point P' in Z', and let P' coincide with P at the time t 
measured in Z. Let the corresponding time measured in Z' be t'. 
Let the observer at P' start counting the number of wave crests 
reaching him between the time r'/c when the labelled wave crest 
passes him and the time t'. The number of wave crests counted 
in Z' will be equal to v'(t' — r'/c), where v' is the frequency of the 
light measured in Z', and O'P' is equal to r'. It is being assumed that 
the velocity of light in vacuo is the same in Z and Z'. Now the 
number of wave crests between the labelled wave crest and the wave 
crest passing the points P and P' at the instant they coincide is a 
pure number and is an invariant and independent of any particular 
co-ordinate system. Hence the phase must be an invariant, and for 
a plane wave in vacuo one must have 


^ x cos 6 + y sin 6 j 


x' cos 6' + y' sin O' 
c 


) 


(4.27) 


The co-ordinates and time in eqn (4.27) refer to the coincidence of 
P and P', and these co-ordinates and time are related by the 
Lorentz transformations. Substituting for x, y and t on the left- 
hand side of eqn (4.27) one obtains 


l.h.s, 


= v [y( t, + i) ~ 

= v[K(l -^cosfl) 


y(x 4 - vt) cos 6 y . 

---sin 0 

c c 


sin e] 

c) c 


This expression has the same mathematical form as the right-hand 
side of eqn (4.27), confirming that a plane wave in Z transforms 
into a plane wave in £'. Since the two expressions must be identical, 
the coefficients of *',/ and t' must be the same in both expressions. 
Hence, 


yv (cos 6 — vie) = v' cos 6' 

(4.28) 

v sin 0 = v' sin O' 

(4.29) 

yv[ 1 — (vjc) cos 6] = v' 

(4.30) 
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Eqn (4.30) is the expression for the relativistic Doppler effect. 
Dividing eqn (4.29) by eqn (4.28) one obtains 

Vl - r 2 /c 2 sin 6 V l - v 2 /c 2 tan 6 


tan O' — 


cos 0 — vjc 

Dividing (4.29) by (4.30): _ 

V 1 — v 2 jc 2 sin 0 


1 — (vie) sec 0 


sin O' = 


1 — (vjc) cos 0 


(4.32) 


The inverse relations can be obtained by interchanging primed and 
unprimed quantities and by replacing v by — v. Eqn (4.31) gives 
the inclination 0’ of the wave normal relative to the x' axis in 
Z' at the point P' at the time t'. It can be seen that O' and 6 
are not equal, so that the inclination of the wave normal is different 
in Z and Z'. 

Equation (4.31) can be applied to calculate the aberration of a 
ray of light from a distant star. In order to apply eqn (4.31) to the 
case shown in Figures 4.2(a) and 4.2(b), the direction of the ray of 
light shown in Figure 4.3 must be reversed so that the light comes 
towards the origin, that is 6 in eqn (4.31) must be replaced by 
+ (77 + a) and O' by +(tt -f- aj)- The equation 


Vl — v 2 jc 2 sin 0 
tan 6 = ' 


becomes 


or 


tan (n + aj) = 


cos 0 — vjc 

V 1 — v 2 /c 2 sin (rr + a) 
cos (77 -j- a) — vjc 


(4.31) 


, Vl — v 2 jc 2 sin a 

tan a, =-;-;- 

1 cos a + vjc 


(4.33) 


This is in agreement with eqn (4.24). Similarly if v changes sign, 
eqn (4.31) becomes the same as eqn (4.25). 


4.4.4. The Doppler Effect 

In Section 3.10.2 the Doppler effect was calculated for the simple 
case when the relative velocity between the source and the observer 
was along the line joining them. The general case will now be 
considered. Consider a point source of light which is at rest at the 
origin of Z'. In Z the source is moving with uniform velocity v 
along the x axis. Consider the light emitted at the instant when the 
origins of Z and Z' coincide at t = t' = 0. Let this light reach the 
observer, who is at rest at the point P in Z, at a time t, and let this 
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event be at a point P' in Z' at a time t', as shown in Figures 4.4(a) 
and 4.4(b). The spherical wave in Z can be represented by 

A 

ip = — cos 2irv(t — rjc) 

Let the spherical wave in Z' be represented by 

A' 

ip' = p- cos 2nv'(t' — r'/c) 

The frequency of the light measured in the inertial frame Z', in 
which the source is at rest is denoted by v' and the frequency of the 


The observer is at rest 
in 2 at the point P 




The light emitted from 
0 at 1=0 reaches P 
at a time -f later. In 
this time interval the 
source moves in 2 


- v = velocity of 
the source 

0 is the retarded position 
of the source, that is the 
position of the source 
when the light is emitted 

(a) 



f Moves with velocity^ 
«■ relative to 2 ) 


,The light emitted from 
O'at AO reaches P' 
at a time-j-’ later 
when P coincides with P 


The source is 
at rest in 2' 


(b) 


Figure 4.4. The Doppler effect. The light emitted by the moving source when it is at the 
origin of 2 at t =0 reaches the observer, who is at rest at the point P in 2, at a time t = r/c. 
In 2' the source is at rest 


light measured in the inertial frame Z in which the source is 
moving when the light is emitted is denoted by v. The phase is 
again an invariant. Proceeding precisely as in Section 4.4.3 one 
obtains 


. V1 — v 2 /c 2 sin 0 

tan 0 — —t ^ -n— 

(cos 6 — vjc) 


(4.34) 


, Vl — a 2 /c 2 sin 6 
sin d i — (a/c) cos 6 


(4.35) 


or 


v — yv( 1 — (vjc) cos 0) 

v'V 1 — v 2 jc 2 
V — 1 — (vjc) cos d 
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Equation (4.36) gives the relativistic Doppler effect in the general 
case when the velocity of the source is not necessarily along the line 
joining the source and the observer. Now v cos 0 is the component 
of the velocity of the source in the direction of the observer at the 
instant when the light was emitted by the source, that is, when it 
was at the origin. By the time the light reaches the observer the 
source will have moved, and in this time interval the source may 
undergo changes in its motion. It is generally assumed that once 
the light quanta leave the source, their frequencies are unaffected 
by any subsequent changes in the motion of the source. The position 
of the source at the instant when the light is emitted is called the 
retarded position of the source. 

The transformations for the wavelength can be obtained from eqn 
(4.36) using X'v' — Xv — c. 

When the source is approaching the observer in Figure 4.4(a), 
0=0, cos 0 = +l, and eqn (4.36) reduces to 


v'V 1 — ti 2 /c 2 , lc v 

(1 — v/c ) — V V c — v 


(4.37) 


The frequency of the light goes up when the source is approaching 
the observer. Eqn (4.37) is in agreement with eqn (3.32), which was 
derived in Section 3.10 directly from the Lorentz transformations. 

Similarly, if the source is moving directly away from the observer 
when the light is emitted, then 0 = it, cos it = — 1, and eqn (4.36) 
becomes 


v = 


v'Vl - a 2 /c 2 _ , f, 

1 4 -v/c V N i 


c v 


(4.38) 


In this case the frequency goes down. Eqn (4.38) is in agreement 
with eqn (3.28). It was described in Section 3.10 how Ives and 
Stilwell 2 confirmed the relativistic formulae for the Doppler effect 
when 0=0 and 0 = tt. 

If, in the inertial frame Z, the light reaches the observer in the 

direction which according to the observer at P in Figure 4.4(a) is 

at 90 degrees to the direction of relative motion in the retarded 

position of the source, then putting cos 0 = 0 in eqn (4.36), one 

obtains , 

v — yv 

or - - 

V = v' V1 — v 2 /c 2 

According to the theory of special relativity, if a beam of atoms 
which is emitting light is observed in a direction which according 
to the observer is at right angles to the direction of relative motion, 
then the frequency of the light should differ from the frequency the 
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light would have if the source were at rest relative to the observer. 
This is the transverse Doppler effect. According to the classical 
ether theories there should be no change in frequency in this case. 
On relativity theory the difference arises from the different measures 
of time in 27 and 27'. The experimental confirmation of the trans¬ 
verse Doppler effect would be further support in favour of the theory 
of special relativity. 

4.4.5. The Visual Appearance of Rapidly Moving Objects 

Until McCrea 3 , Terrell 4 , Penrose 5 , and Weisskopf 6 drew attention 
to it, it was a popular misconception that, if one looked at an object 
which is moving with a speed comparable to the speed of light, 
then one would see the object contracted in the direction of motion. 
It must be remembered that what one sees (or photographs) at a 
particular instant depends on the light actually reaching the eye 
(or the camera) at that particular instant. These light quanta will 
have left different parts of the object at different times. Those 
quanta which come from the parts furthest away from the observer 
leave before the quanta coming from the parts of the object nearest 
to the observer. Consider a cube of side l, which is moving with a 
uniform velocity v, which is comparable to, but less than, the velocity 
of light, relative to an inertial frame 27 as shown in Figure 4.5(a). 



Figure 4.5. ( a) The cube is moving with uniform velocity v relative to E. The light 
quanta emitted from the corners A, B, C and D, when the x co-ordinate of the cube is 
zero , reach O simultaneously. Light quanta emitted from the corners E and F when they 
are at the positions E' andF' also reach 0 at the same time as the quanta from A, B, C 
and D. The views of the cube , when it is far away from the observer , according to the 
classical theories and the theory of special relativity are shown in (b) and (c) respectively 
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Let the cube be viewed in a direction perpendicular to the direction 
of motion and from a large distance, so that the angle subtended. 
by the cube at the position of the observer situated at 0, the origin 
of E, is very small as shown in Figure 4.5(a). We shall start by apply¬ 
ing non-relativistic considerations. If the cube were stationary one 
would only see the face ABCD. When the cube is moving the light 
quanta from the four corners A, B, C and D which reach the eye at 
the same time to form a picture must have left the four corners at 
the same time. If the corners are equidistant from the point of 
observation when the light is emitted, as shown in Figure 4.5(a), 
then the points A, B, C and D appear to be at the corners of a square. 
When the cube is moving relative to the observer, light quanta from 
the corners E and F can also reach the eye as shown in Figure 4.5(a). 
The quanta from these corners leave the cube at an earlier time, 
when the corners E and F are at the positions E' and F' as shown in 
Figure 4.5(a). The side ADFE of the moving cube is therefore visible 
and the side ADFE appears as a rectangle. If the eye is far away 
from the cube, to a first approximation, the light from E goes an 
extra distance l in the time that E' goes to E. Hence EE’ is equal 
to (v/c)l. The non-relativistic picture of the cube is illustrated in 
Figure 4.5(b). This view of the cube is similar to the picture of a 
stationary cube which has been turned through an angle, such that 
the face ADFE is visible as shown in Figure 4.6(a). Let the cube be 
turned through an angle « such that the projection of the side AE 



Figure 4.6. The visual appearance of a stationary cube rotated 
through an angle a = sin -1 v/c; the top view is shown in (a) 
and the front view in ( b). The latter is the same as the 
visual appearance of a distant moving cube , predicted by the 
theory of special relativity and iltustratedin Figure 4.5(c) 
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is equal to ( v/c)l as shown in Figures 4.6(a) and (b). The angle a 
is given by 

sin a = vjc 

The projection of the side AB is then equal to 
l cos a = / Vl — v 2 lc* 

Thus, according to non-relativistic theory, the visual appearance of 
the moving cube is not quite the same as the visual appearance of 
the rotated stationary cube since, when the projection of the side 
AE is equal to (v/c)l, the projection of the side AB of the stationary 
cube is equal to Z(1 — w 2 /c 2 )*, whereas the side AB of the moving 
cube appears to have a length L Thus according to non-relativistic 
theory the moving cube should have the same visual appearance as 
a stationary rotated cube whose length AB is increased by a factor 
1/(1 - » 2 / c 2 )*. 

According to the theory of special relativity, the lengths AE, AD 
DF and EF, which are all perpendicular to the direction of motion 
are unchanged, so that the appearance of ADFE would be the same 
as for the non-relativistic case. On the other hand, the length AB, 
which is parallel to the direction of motion, should be reduced to 
Z(1 — zj 2 /c 2 ) i . This length contraction exactly compensates the 
apparent increase in the length of AB in the classical case as shown 
in Figure 4.5(c). Thus, according to the theory of special relativity, 
the visual appearance of the moving cube is the same as that of a 
stationary cube rotated through an angle a = sin -1 (v/c) [compare 
Figures 4.5(c) and 4.6(b)]. 

It was shown by Terrell 4 that, in the general case, when the 
moving object is viewed at any angle then, according to the theory 
of special relativity, provided the angle subtended by the moving 
object at the point of observation is very small, the visual appearance 
of a moving object is undistorted in shape but rotated by the angle 
of aberration, where the angle of aberration can be calculated from 
eqn (4.35). If the object subtends a finite angle at the point of 
observation, then the different parts of the object appear rotated 
by different amounts and this leads to a distortion of shape. It was 
shown by Penrose 5 that the surface of a moving sphere would appear 
distorted if the sphere subtended a finite angle, but the outline of 
the moving sphere would still be circular. 

One must be careful not to confuse measuring with seeing. One 
would measure the length of the moving cube by observing the 
positions of both ends of the cube on a ruler at the same time. 
According to the theory of special relativity one should then measure 
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the length to be contracted. If one saw an undistorted but rotated 
picture of a moving cube, as predicted by the theory of special 
relativity, then, if one calculated the dimensions of the cube allowing 
for the finite time of flight of the light quanta from the various 
parts of the cube, one would deduce that the length contraction had 
taken place. 

The light from a moving object undergoes both aberration and 
the Doppler shift in frequency. It will be assumed that the moving 
object emits monochromatic light in the reference frame S' in 
which it is at rest. It is reasonable to assume that this light is emitted 
isotropically in this reference frame. Relative to S the object is 
moving with velocity v. The angular distribution of the light in S 
can be calculated from eqn (4.35). Since 0 is always less than O' 
except when O' = 7 r, the effect of aberration is to concentrate more 
and more of the light in the direction of motion of the object. Thus 
the intensity of light emitted is greater in the forward than in the 
backward direction, and at extremely high velocities the light would 
be emitted mainly in the forward direction. When the source of 
light is approaching the observer, the frequency of the light is 
increased and for very high velocities would be shifted into the 
ultra-violet. After the source passes the observer the frequency 
drops and is shifted towards the red end of the spectrum, since the 
source is then moving away from the observer. Thus, when a 
monochromatic source of light is moving relative to the laboratory, 
the intensity and frequency of the light emitted in different directions 
varies. Thus, in addition to its apparent rotation in space, the 
colour and apparent brightness of a moving object are different 
when the object moves relative to the observer. 
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PROBLEMS 

Problem 4.1 — A train is passing through a station at a speed of 30 m/sec. 
A marble is rolled along the floor of one of the compartments with a 
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velocity of 15 m/sec relative to the train. Calculate the speed of the marble 
relative to an observer standing on the platform, (a) if the marble rolls in 
the direction of motion of the train, ( b ) if the marble rolls perpendicular 
to the direction of motion of the train as measured by a passenger on the 
train. 

Problem 4.2 —A radioactive nucleus is moving with a velocity c/10 relative 
to the laboratory when it emits a /3-particle with a velocity 0-8c relative to 
the co-ordinate system in which the decaying nucleus is at rest. What is the 
velocity and direction of the ^-particle relative to the laboratory, if, relative 
to the radioactive nucleus, it is emitted (a) in the direction of motion of 
the nucleus relative to the laboratory, and ( b ) perpendicular to the direc¬ 
tion of motion? 

If, in the laboratory system the /S-particle moves in a direction perpendic¬ 
ular to the direction of motion of the radioactive nucleus (c) what is its 
velocity in the lab.-system and ( d ) in what direction is it emitted relative to 
the direction of motion of the decaying nucleus relative to the laboratory, in 
the co-ordinate system in which the decaying nucleus is at rest ? 

Problem 4.3 —A radioactive nucleus which is at rest in Z', but is moving 
with velocity c/4 along the x axis of Z, emits a /S-particle of velocity 08c 
(relative to Z') at an angle of 45 degrees to the x' axis of Z'. What is the 
velocity of the /1-particle relative to an observer going at a velocity of c/4 
along the negative x axis of Z? 

Problem 4.4 —-An observer moving along the x axis of Z with velocity v 
observes a body of proper volume V 0 moving with velocity u along the x axis 
of Z. Show that the observer measures the volume to be equal to 

/g - *)(<? - g) 

V (c 2 - w) 2 

[Hint: The volume is F 0 V1 — u' 1 /^. Use eqn (4.14) to transform 
Vl - u'^c 2 .] 

Problem 4.5- —A radioactive nucleus is moving with velocity v relative to 
the laboratory ( Z) when it emits a /3-particle with velocity u' relative to the 
co-ordinate system Z' in which the radioactive nucleus is at rest. The angle 
measured between lT and the x' axis of Z' is a' (measured in Z') . Calculate 
a, the velocity of the /5-particle relative to the laboratory, and calculate a, 
the angle between u and v in the lab.-system. 

Show that if the radioactive nucleus is moving with velocity u' relative 
to the laboratory when it emits a /3-particle with velocity v, then the speed 
of the /?-particle relative to the laboratory is the same as previously but a 
is not. 

Problem 4.6 —A physicist tells his girl friend, who weighs 20 stone (or 
280 lb.) that the best way for her to slim is to move so fast relative to the 
laboratory that she is Lorentz contracted, (a) What speed would she have 
to move to ‘reduce’ her measured dimensions in the direction of motion to 
half her laboratory size? ( b) What would be her mass relative to the 
laboratory? (Refer to Section 5.2.) (c) What would she really look like 
relative to the laboratory ? 
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Problem 4.7 -—An observer at rest at the origin of an inertial frame 2 
sees two particles travelling with equal speeds 0-8f. One goes along the 
positive x axis, the other along the negative x axis, (a) What is the relative 
velocity of the particles in 2 ? How can this exceed the velocity of light? 
( b) With what speed are the particles transferring energy and momentum 
in 2? (c) What is the velocity of one particle relative to the other? 

Problem 4.8 —In a cathode ray tube the speed at which the electron beam 
sweeps across the fluorescent screen can exceed the velocity of light in empty 
space. How can you reconcile this with the limiting character of the 
velocity of light in empty space? [Hint: Read Section 5.3, and compare 
with Figure 5.2. What is the speed of individual electrons?] 

Problem 4.9 —The Cartesian axes of the space-co-ordinates in the 27'-system 
are parallel to those of the 2 system and the 2' origin is at ( vt , 0, 0) at a 
time t in the T-system. A particle describes the parabolic trajectory 
x — ut, y — \at 2 in the T-system. Find its trajectory in the JC'-system and 
show that its acceleration in 2' is along O'y' and of magnitude 

a{ 1 - v 2 lc 2 )l( 1 - uv[c : 2 ) 2 

[Hint: Use the Lorentz transformations to transform x, y and t and show 
_ (u-v) a(l - v*l<2) 

(1 -vujc 2 ) ,y $ (1 -vul<?)* 

Check on the acceleration in 2', using the transformations for acceleration.] 
Problem 4.10 — A particle is at rest at the origin of an inertial frame 2 
at a time t = 0. It moves such that it has a constant acceleration g relative 
to the inertial frame in which it is instantaneously at rest. Find the velocity 
and position of the particle relative to 2 after a time t measured in 2. 
Discussion —From eqn (4.18) 

. (1 ~ *l*)*q. = df U -I 

^ * (i-«7 **) 3 dt[vi - 

Then proceed as in Section 5.4.1, except that now g = qElm 0 . Show that 
corresponding to eqn (5.36), one now has 


u 



u 

g(l - u 2 jc 2 ) 1 


Corresponding to eqn (5.37), 



Show that eqn (5.38) can be written as (x + c 2 /g) 2 — c 2 t 2 = c i /g 2 , or 
gx 2 + 2c 2 x - gc 2 t 2 = 0. 

This is the mathematical equation for a hyperbola. This type of motion is 
consequently known as hyperbolic motion. 

The motion of a charge q starting from rest in an electric field is an 
example of the above kind of motion. If the electric field is E x , trans¬ 
forming to the reference frame 2' in which the charge is instantaneously at 
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rest, one has E' x = E x (cf. Section 7.3.3) and the force acting on it is qE' x . 
Since the particle is instantaneously at rest in 2', one can write 


, qE'x q e x 
a * m o rn a 

so that its acceleration in the inertial frame in which it is instantaneously 
at rest is a constant.] 

Problem 4.11 —Prove that light signals sent out from the origin of 2 after 
a time / = cfg in 2 will never catch the particle discussed in the previous 
problem. [Hint: Show that the line x = c(t — cjg) is an asymptote to 
the hyperbola ( x + c 2 [g) 2 — A 2 = <r 4 /g 2 .] 

Problem 4.12 —Consider three inertial frames 2, 2’ and 2". Let 2' move 
with uniform velocity v along the x axis of 2, and let 2" move with uniform 
velocity v' along they' axis of 2’. Let the origins of 2, 2' and 2" coincide 
at t = t’ = t" = 0. Show that the line passing through the origins of 
2 and 2 ’ makes angles 0 and 0" with the x and x" axes of 2 and 2", where 


and 


tan 0 = 


v'(l - ^/c 2 )* 


tan 8" = 


v 

v’ 


vV\ — y'*/r 2 


If v and v' are both c, and v' v, show that 0' - 0 - vv’jlc 2 . 

[Hint: Apply the velocity transformations.] 

Comment —This is an example of the Thomas precession. Consider an 
electron moving in a Bohr orbit around a nucleus. According to the theory 
of the Thomas precession, the axes in which the electron is instantaneously 
at rest appear to precess relative to the nucleus with angular frequency to 
equal to — v x a/2c® where v is the velocity and a the acceleration of the 
electron relative to the nucleus. Derive the above equation and discuss 
the role it plays in the interpretation of optical spectra. Reference: Eisberg 
R. M. Fundamentals of Modem Physics p. 340. 

Problem 4.12 —Show that in a dispersive medium, Fresnel’s dragging 
coefficient is equal to 



A dn) 

ndlj 


where n is the refractive index appropriate to A, the wavelength of light in 
the laboratory system. 

For the yellow sodium D lines, the refractive index of carbon disulphide 
is 1-6295 and dn/dA = —1-82 x 10 6 /m. Calculate a precise value for 
Fresnel’s dragging coefficient. You may take Xj) — 5-89 X 10~” m. 
(Rosser 1 , Section 4.4.3.) 

Problem 4.14 —What is the Doppler shift in wavelength of the H a (6563 A) 
line from a star which is moving away from the earth with a velocity of 
300 km/sec. 
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Problem 4.15 —A light source is moving past an observer stationary in 
the laboratory system with a velocity of 0-6c. The distance of closest 
approach is 10 m from the observer. Find the distance (measured in the 
laboratory system) of the source from the observer when the source emits 
light which shows no Doppler shift when it reaches the observer, that is, 
when the frequency of the light reaching the observer, measured in the 
laboratory system, is equal to the frequency of the light measured in the rest 
frame of the source. 

Problem 4.16 —Assume that light is emitted isotropically in the co-ordinate 
system in which a sodium lamp is at rest. If the lamp moves with velocity 
c/4 relative to the laboratory, calculate the half angle of the cone containing 
half the emitted photons. [Hint: Consider light emitted at 90 degrees 
relative to the direction of motion of the lamp in the reference system in 
which the lamp is at rest. Calculate its direction in the lab.-system.] 
Problem 4.17 —A pencil of light rays is emitted into a solid angle dO' 
inclined at an angle 8' to the x' axis in 2'. Show that in 2, which moves in 
the negative x' direction with uniform velocity v relative to 2', the pencil of 
light rays is emitted into a solid angle df2 given by 

= { TTj~e^ d£y ’ where p = vlc 


Show that the solid angle dfl is inclined at an angle 8 to the x axis in 2, 
where 6 is given by 


cos 


cos 8’ + 1 3 
1 + j? cos 8' 


[Comment —Cos 8 can be obtained by dividing eqn (4.35) by eqn (4.34) 
and taking the inverse. We have 


dQ d(cos 0) 
d£V d(cos 6') 


If a source at rest in 2' emits light quanta isotropically in 2', that is the 
number of quanta per unit solid angle in 2' is independent of 8' in 2', the 


number/unit solid angle in 2 is proportional to 



There are more 


quanta in the forward direction in 2 and less in the direction opposite to 
the direction of motion of the source. There is also a Doppler shift in 
frequency.] 

Problem 4.18 —When excited B7 Fe nuclei decay, they sometimes emit 
photons of energy 14-4 keV corresponding to a frequency of 3-46 x 10 18 
c/sec. If the 57 Fe nucleus moves with velocity c/5 when it decays via the 
14-4 keV level: ( a ) calculate the frequencies of radiation emitted at angles 
of0, 60, 90, 120 and 180 degrees (in the laboratory system) relative to the 
direction of motion of the nucleus; ( b) by thinking of the radiation in 
terms of photons calculate the relative numbers of quanta per steradian at 
each angle in the lab.-system. [Hint: Use eqn (4.36) for part (a). For 
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part ( b) assume the radiation is isotropic in the rest system of the'decaying 
nucleus and use the inverse of the formula derived in Problem 4.17.'] 

Problem 4.19 —A plane mirror is moving in the direction of its normal 
with uniform velocity v in the positive x direction in the inertial frame 2. 
Let a ray of light of wavelength strike the moving mirror at an angle of 
incidence a. lt and let the ray be reflected at an angle a 2 to the normal with 
wavelength A 2 . Show that 

sin a, sin a 2 _ A x _ c + v cos oq 

cos otj + vjc cos — vjc ’ A 2 c — v cos a 2 

[Hint: The angle of reflection is equal to the angle of incidence in the 
reference frame in which the mirror is at rest. Rosser 1 , Section 4.4.8.] 
Problem 4.20 —Light of wavelength 6,000 A is incident normally on a 
mirror which is receding with a velocity 3 x 10 7 m/sec in a direction 
away from the incident light, (a) Calculate the change in wavelength on 
reflection. 

If the light is incident at an angle of 45 degrees to the normal of the 
mirror, calculate: (b) the change in wavelength; (c) the angle of reflection. 

Problem 4.21 —A 100 cm long ‘rigid’ rod moves longitudinally over a flat 
table. In its path is a hole 100 cm wide. Suppose the Lorentz factor of the 
rod relative to the table is 10. To an observer moving with the rod, the 
hole appears only 10 cm wide and the rod, being ‘rigid’, should slide over 
the hole. To an observer at rest at the table, however, the rod appears only 
10 cm long, and in passing over the 100 cm hole it is bound to fall somewhat 
under gravity, and will consequently be stopped. Which view is correct? 
(This example is taken from Rindler W., Special Relativity. For a discussion 
of this type of problem refer to Rindler 7 and Shaw 8 .) 
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RELATIVISTIC MECHANICS 


5.1. INTRODUCTION 

The laws of Newtonian mechanics do not obey the principle of 
relativity when the co-ordinates and time are changed according 
to the Lorentz transformations, and must be modified if mechanics 
is to be incorporated into the theory of special relativity. An 
approach due to Lewis and Tolman 1 is followed, and an imaginary 
experiment (a Gedanken Experiments ) is considered in which it is 
assumed that two spheres collide. It is shown how Newtonian 
mechanics must be modified in this special case, in order to make 
the theory of mechanics consistent with the transformations of the 
theory of special relativity. The results will then be extended to 
more general cases. Instead of starting directly from Newton’s laws, 
we shall start from the law of conservation of linear momentum. 
Alternatively one can now approach relativistic mechanics from 
experiments on high-speed particles, without introducing the 
Lorentz transformations. An outline of this approach is given in 
Appendix 4. 

5.2. THE MASS OF A MOVING PARTICLE 

It will be postulated that the law of conservation of linear mo¬ 
mentum holds in relativistic mechanics. It will also be assumed 
that the mass of a body is not absolute, but may depend on the 
speed of the body. 

Consider the collision of two small, identical, perfectly elastic 
spheres. It will be assumed that the spheres are small enough for 
the collision to be considered as taking place at approximately one 
point of space at one instant of time. Let sphere 1 have a velocity 
iT, before the collision having components u' x , u' v , u' z — 0 relative 
to the inertial frame Z' shown in Figure 5.1(b). Let sphere 2 have 
the same speed u' relative to Z', before the collision, but let it move 
in the opposite direction to sphere 1, such that the components of 
the velocity of sphere 2 relative to 27' are — u' x , —u’ y , u z = 0 as 
shown in Figure 5.1(b). Let the spheres be thrown in such a way 
that, when they collide, the line joining their centres is perpendicular 
to the x’ axis of Z'. If the collision is perfectly elastic, the total 
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kinetic energy should be conserved in the collision. From the 
symmetry of the collision, it follows that the total speeds and the x' 
components of the velocities of the spheres should be unchanged by 
the collision, but the y’ components of the velocities of the spheres 
relative to E' should be reversed. Linear momentum is conserved 
relative to 27' in such a collision. 

The collision of the spheres will now be considered relative to the 
inertial frame 27, which is moving with velocity — v relative to 27' 




Figure 5.1. The collision of two small perfectly elastic spheres is observed from two inertial 
reference frames 2 and 2'. In 2', the spheres approach each other with equal speeds before 
collision. It is assumed that linear momentum is conserved relative to both 2 and 2' 


along the x’ axis of 27'. If v is numerically equal to u x , then, before 
the collision, sphere 2 moves parallel to the negative y axis of 27 as 
shown in Figure 5.1(a). If the new laws of mechanics are to obey 
the principle of relativity, when the co-ordinates and time are 
changed according to the Lorentz transformations, then all velocities 
must be transformed from 27' to 27 using the relativistic velocity 
transformations, namely eqns (4.9), (4.10) and (4.11). 


SPHERE 1 

Before collision: The x' and y' components of the velocity of 
sphere 1 in 27' are +u' x and +«' respectively, as shown in Figure 
5.1(b). Using eqns (4.9) and (4.10), we have for the velocity of 
sphere 1 in 27 before collision 

(«; + «) , , u' y V\ - v*/c* 

(1 + vu’Jc 2 ) ’ “ (1 + vu'Jc 2 ) 
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After collision: The x and y' components of the velocity of sphere 
1 in 27' are -\-u' x and — u' y respectively, as shown in Figure 5.1(b). 
Using eqns (4.9) and (4.10), we have for the velocity of sphere 1 in 
27 after collision 


(«i4 = 


(4 + y) . 

(1 + vujc 2 ) 5 


-«;v i - v 2 ic 2 
{Ul)v (1 + vu'Jc*) - 


— («l)v 


(5.2) 


It can be seen that the x component of the velocity of sphere 1, 
relative to 27, is unchanged by the collision, whereas thej component 
of its velocity is reversed as shown in Figure 5.1(a). The total speed 
of sphere 1, relative to 27, is unchanged by the collision. 


SPHERE 2 


Before collision: The x and y' components of the velocity of 
sphere 2 in 27' are — u' x and — u( respectively as shown in Figure 
5.1(b). Using eqns (4.9) and (4.10), and remembering that u' x = v, 
we have for the velocity of sphere 2 in 27 before collision. 


(«*)» 


(-4 + ») n , , -<4i - y*lc 2 

(1 - vujc*) - U> - (1 - Vujc*) 


(5.3) 


After collision: The x' and y' components of the velocity of sphere 
2 in 27' are —u' x and +«' respectively, as shown in Figure 5.1(b). 
Using eqns (4.9) and (4.10), since u' x = v, we have for the velocity 
of sphere 2 in 27 after collision 


(«*)* = 0; 


(«a)v = 


u v V 1 - o*/c 2 
(1 - vu’Jc 2 ) 


-Mu 


(5.4) 


Relative to 27, the x component of the velocity of sphere 2 is zero 
before and after the collision. The y component of its velocity is 
reversed by the collision as shown in Figure 5.1(a). The total speed 
of sphere 2 relative to 27 is unchanged by the collision. 

Let m x and m 2 be the masses of the spheres, before the collision, 
relative to 27. If the mass of a moving body depends only on its 
speed, since the total speeds of the spheres are unaffected by the 
collision, the masses of the spheres relative to 27 should be m x and m t 
respectively both before and after the collision. 

It will now be assumed that linear momentum is conserved 
relative to 27 as well as 27'. Equating the total linear momentum 
in the -\-y direction relative to 27 before collision to the total 
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momentum in the -f -y direction relative to 27 after collision, we have 


m i( u i)v + m s:Mv = m 1 (u 1 ) y + m 2 {u 2 ) v 

Substituting from eqns (5.1), (5.2), (5.3) and (5.4), since (u^y — 
— («i)„ and (u 2 ) y = — {u 2 ) y> we have 

m i(“x)i, + m 2 {u^ y = -~ m i( u i)v — m 2 (u 2 ) y 
2m 1 (u l ) v = —2 m 2 (u 2 ) y 

m^u’yV 1 — v 2 /c 2 m 2 u y V 1 — v 2 jc 2 
(1 + vu'Jc 2 ) “ (1 - vu'Jc 2 ) 

Hence, 

mjl — vu'Jc 2 ) = m 2 (l + vu'Jc 2 ) (5.5) 


From eqn (4.15), for a particle having a velocity u in 27 and a 
velocity u', having an x' component u’ x in 27', we have 


1 + 



(1 - v 2 lc 2 ){\ - u' 2 /c 2 ) 
(1 - u 2 \c 2 ) 


(4.15) 


Applying eqn (4.15) to sphere 1, since before collision the velocity 
of sphere 1 in 27is u l5 and its speed in 27' is u having an x' component 
+u' x , we have _ 


m x 

+ ~c 2 = 


(1 - v 2 /c 2 )( 1 - u' 2 /c 2 ) 
(1 - u\lc 2 ) 


(5.6 a) 


Similarly, since before collision the velocity of sphere 2 in 27 is u 2 , 
and its speed in 27' is u' having an x' component —uj from eqn 
(4.15) we have 


2 _ vjx = 


(1 - v 2 /c 2 ){ 1 - u' 2 /c 2 ) 


(1 - ujc 2 ) 

Substituting from eqns (5.6a) and (5.6 b) into eqn (5.5), 


(5.6i>) 


m 1 V1 — u\jc 2 = m 2 V 1 — u\jc 2 (5.7) 

If the y component of linear momentum is to be conserved in 27, the 
above relation must be satisfied. Eqn (5.7) must be true whatever 
the magnitude of u , so that it must hold for various values of % 
and u 2 . If m l — m g when tq = 0 and m 2 — m 0 when u 2 = 0, eqn 
(5.7) can always be satisfied, whatever the values of tq and u 2 , 
provided 

m 0 

m , = — , =r 

V1 — u\jc 2 


and m, 
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since eqn (5.7) then reduces to m 0 = m 0 . The x component of the 
total linear momentum relative to £ is also conserved in the collision. 
Hence, linear momentum can be conserved in both £ and £', if 
the mass of a moving particle is redefined, such that, when it is 
moving with a velocity u relative to £, its mass m is equal to 


m = 


m 0 

Vl — a 2 /c a 


(5.8) 


where m 0 is the mass of the particle measured in the inertial frame 
in which it is at rest. Relative to £' the mass of the particle must 
be redefined as m 0 ( 1 — u' 2 /c 2 )where u' is the velocity of the particle 
relative to £'. The quantity m Q is called either the rest mass or the 
proper mass of the particle. The quantity m in eqn (5.8) is called the 
relativistic mass. It can be seen from eqn (5.8) that the mass of a 
particle is a scalar quantity. The theory developed above refers to the 
inertial mass of the particle. It must be emphasized that u is the veloc¬ 
ity of the particle relative to £, which can be taken to be the labor¬ 
atory system. The velocity u has nothing whatsoever to do with 
changing co-ordinate systems. From the requirement that the quant¬ 
ities appearing in the laws of mechanics must transform according to 
the transformations of the theory of special relativity, it was neces¬ 
sary to redefine the mass of a moving particle in the laboratory 
system. It will now be assumed that the mass of a moving particle 
must be defined by eqn (5.8) in the general case. This assumes 
that the mass of a particle is independent of the acceleration of 
the particle relative to the laboratory. This redefinition of mass 
leads one in turn to a new set of laws to replace Newton’s laws 
of motion. These new laws will be applied in the laboratory 
system, and it is shown in Section 5.4 that the predictions of the 
new theory are in excellent agreement with experiments carried 
out in the laboratory. 

In relativistic mechanics the momentum of a particle moving 
with velocity a is defined by the relation 


p ~ “* ” Vi-„«/,« (5 - 9) 

where p is the momentum, m 0 the rest mass and m is the relativistic 
mass of the particle. The momentum has components 
m 0 u x m n u„ m n u , 


P :c = 


Vi 


Pv 


(5.10) 


« 2 /c 2 “ Vl-K 2 /c 2 V1 — w 2 /r 2 

Notice that it is the total velocity u that appears in the denominator 
of the components of momentum. 
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Before discussing the experimental verification of the variation of 
mass with velocity, the dynamics of a single particle is now 
developed, using eqns (5.8) and (5.9). 

5.3. THE RELATIVISTIC DYNAMICS OF A SINGLE 
PARTICLE 

5.3.1. The Definition of Force 

In Newtonian mechanics the force acting on a particle can be 
defined as the rate of change of the momentum of the particle, 
under the influence of the force. Since, in Newtonian mechanics, 
the inertial mass of a particle is assumed to be invariant, one could 
write 

dp d du 

f= dF = dh (mu) = m di = ma (5 - n) 

Hence, in Newtonian mechanics, the force acting on a particle could 
also be defined as the product of the mass of the particle and the 
acceleration produced by the force. If the inertial mass of a particle 
varies with velocity, as given by eqn (5.8), then 
d , du dm 

t -5 <m “>=”‘dr + ”dF (5.12; 

In addition to the term equal to mass times acceleration, there is 
also a term equal to u ^ in eqn (5.12). In the theory of special 

relativity, the definition of force as the rate of change of momentum 
is in general not equivalent to defining force as the product of mass 
and acceleration. In the theory of special relativity, it is chosen to 
define force as the rate of change of momentum. The suitability of 
this choice depends, of course, on whether or not the predictions of 
the resulting theory are in agreement with experiments. The 
additional term u(dm/df) in eqn (5.12) arises from the variation of 
mass with velocity. If the acceleration of the particle gives rise 
to a change in the velocity of the particle, then there is a correspond¬ 
ing change in the mass of the particle. Let this change in mass be 
denoted by dm. Associated with this change of mass there must be 
a change in the momentum of the particle in the direction of motion 
equal to dmu. To produce this change of momentum in a time 
dt there must be a force equal to dmu/dt. 

5.3.2. Work and Kinetic Energy 

As in Newtonian mechanics the work done on a particle will be 
defined as the force f acting on the particle multiplied by the 
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distance through which the particle is displaced in the direction in 
which the force is acting, that is the work done in a displacement 
dl will be defined as the scalar product of f and dl, that is 


dH 7 = f. dl (5.13) 

If it is assumed that all this work goes into increasing the kinetic 
energy of the particle, then, 


AT = f. dl (5.14) 

where the symbol T is used to denote the kinetic energy of the 
particle. The rate of increase of kinetic energy is given by 


dT dl 
dt ~ dt 


=- f. u 


(5.15) 


where u is the velocity of the particle. Now by definition 


J 

f = — (m u), where m = — . .. 0 = ■ 

dt V Vl - u 2 /c 2 


Substituting for f in eqn (5.15), 

dT _ d 
dt dt 


dT d 

= T, ( mu ) u 


dn 


dm 


Now 


m dT u + dT u * u 


“* + “» + «* = U 2 


(5.16) 


Differentiating with respect to time, 



da_ Au„ da. 

d u 


^df + ^df + ^df 

= 2u dt 

i.e. 

du da 



dt = “dt 


Also 

u . u = a 2 


and 




dm 

da dm 

da d | 

( m o \ 

da m 0 a/c 2 

dt 

dt da 

dt da* 

Wl - a 2 /c 2 / 
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Substituting in eqn (5.16), 
dT m„ 


da da a 3 /c 2 

a — + m 0 — 


d t Vl - a 2 /c 2 d t dt (1 - a 2 /c 2 )* 

m ° u (17 (i ~ “ 2 ^ 2 + “ 2 / c2 ) 


(1 - a 2 /c 2 )® 
m 0 a da d 




Integrating, 


(1 - a 2 /c 2 )* d* df 
T _ m o g2 

Vl — a 2 /c 2 


.V1 — a 2 /c 2 . 


+ Cl 


where C x is a constant of integration. If it is assumed that the 
kinetic energy of the particle is zero when the velocity of the 
particle is zero, then C x must be equal to —m 0 c 2 . Hence 


or 

and 


V 1 — a 8 /c 2 


— mnC* = m n c 




° Wl - U 2 /c 2 


- 1 


T = me 2 — m 0 c 2 
me 2 = T + m 0 c 2 


(5.17) 

(5.18) 

(5.19) 


Expanding the right-hand side of eqn (5.17) by the binomial 
theorem, 


1 a 2 3 a 4 

T — m 0 c J1 + + 


1,3a 4 
= g m o u + s m ° c 2 ^- 

If a c then 

T ^ im 0 u 2 

This is in agreement with the formula for the kinetic energy of the 
particle given by Newtonian mechanics. It can be seen from eqn 
(5.17) that, if a tends to c, then the kinetic energy T tends to 
infinity. Hence, according to eqn (5.17), an infinite amount of 
work would have to be done to accelerate a particle up to the 
velocity of light. This illustrates how, in the theory of special 
relativity, the velocity of light plays the role of a limiting velocity 
for a particle. It must be stressed that the theory of special relativity 
does not say that one cannot have velocities exceeding the velocity 
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of light in vacuo, but simply says that energy and momentum cannot 
be transmitted with a velocity exceeding the velocity of light in 
vacuo. As an example of a velocity exceeding the velocity of light, 
consider two rulers which cross each other obliquely and lie one 
above the other as shown in Figure 5.2. Let the angle between the 
rulers be 0. Let one of the rulers be stationary and parallel to the 
x axis and let the other ruler move with velocity u in the -\-y direction 
as shown in Figure 5.2. The point of intersection of the rulers moves 
in the negative * direction with a velocity equal to u cot 0. If 



Figure 5.2. One ruler is stationary , whereas the other 
ruler moves with uniform velocity u in the +y direc¬ 
tion. The point of intersection of the rulers moves with 
a speed u cot 0, which may exceed the velocity of light 
even though u is always less than c 

u = 0-8c and 0=15 degrees, then u cot 0 is approximately equal to 
3 c. Thus the point of intersection of the rulers can move with a 
velocity exceeding the velocity of light. The momentum and 
energy of the moving ruler travel in the -f y direction with a velocity 
u, which is always less than c. Another example of a velocity greater 
than the velocity of light is the phase velocity of x-rays in crystals, 
for which the refractive index is less than unity. The energy of the 
waves travels with the group velocity, which is always less than or 
equal to c. In a medium of refractive index n > 1, the velocity of 
light is c\n. Particle velocities exceeding this are possible and have 
been observed. It is the velocity of light in vacuo which is the limiting 
velocity for transmitting energy and momentum. 

Eqn (5.18) can be written as 

T = me* — m 0 c 2 = (m — m 0 )c 2 (5.20) 

showing that associated with a change in the kinetic energy of a 
particle there is a change in its inertial mass. 
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It is convenient to introduce a quantity E defined by the relation 

E=T+ m.c 2 = me 2 = (5-21) 

Thus, E is equal to the sum of the kinetic energy of the particle and 
the quantity m 0 c 2 , which is generally called the rest mass energy of 
a free particle. In Section 5.8 the interconvertibility of rest mass 
energy and other forms of energy is illustrated. The quantity E 
is called the total energy of a free particle or sometimes, simply the 
energy of the particle. The total energy E is a scalar. The symbol 
E is also used for the electric intensity vector. No confusion should 
arise as it will always be obvious from the text to which quantity 
the symbol E refers. Some writers prefer to use the symbol W for 
the total energy of a particle. 

Now, from eqn (5.9), 

m 0 ix 

r Vl - u 2 /c 2 

Hence 

2 2 2 

i rrfiA - W °“ -l m 2 c i 

P c + m oT — (i _ u 2j c ^ A- "'o 6 

m%u 2 c 2 + «q £ 4 — m§«V 
= (1 - « 2 /c 2 ) 


(5.22) 

(5.23) 

Eqn (5.23) is an important relation which is used frequently in high 
energy nuclear physics to calculate the total energy of a particle, 
when the momentum is given and vice versa. 

Now 

T = E — m 0 c 2 

Hence, 

T = cV m%c 2 + p 2 — m 0 c 2 (5.24) 

Eqn (5.24) relates the kinetic energy and the momentum of a 
particle; this relation is also used extensively in high energy nuclear 
physics. 
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Differentiating eqn (5.23) with respect to p. 


But 

Hence 


d E pc pc 2 pc 2 

d P Vm 2 c 2 + p 2 ~ cVm 2 c 2 + p 2 E 


E = me 2 and p = mu 


dE_ 
dp ~ 


(5.25) 


Newton’s laws are generally sufficiently accurate, except when the 
velocities of the particles are comparable to the velocity of light. 
The only available particles which have such very high velocities 
are high energy atomic particles, such as protons, electrons, mesons 
etc. The main application of the relativistic dynamics of a single 
particle is to the motion of such particles in electric and magnetic 
fields. It is assumed in relativistic electromagnetic theory that when 
it is not radiating, the force acting on a charge q moving with a 
velocity u in an electric field of strength E and a magnetic field of 
strength B is given by the Lorentz force: 


f = qE + qu X B (5.26) 

where u x B is the vector product of u and B. The value of the 
charge q is assumed to be independent of the velocity of the particle. 
This is the principle of constancy of electric charge. The values of 
E and B are those calculated from the charge and current distri¬ 
butions using Maxwell’s equations. The application of eqn (5.26) 
to the dynamics of a single particle is reviewed in Section 5.4. 


5.3.3. Units 

The equations developed in Section 5.3.2 can be applied using 
any consistent set of units, such as the metre-kilogram-second 
(M.K.S.) system of units. M.K.S. units are not always convenient 
in atomic physics. For example, the rest mass of the electron is 
9-11 X 10~ 31 kg. It is generally more convenient in atomic and 
nuclear physics to work with a system of units based on the electron 
volt as the unit of energy. 

High energy particles are generally obtained by accelerating the 
particles in an accelerator. It is shown in Section 5.4.1 that if a 
particle of charge q is accelerated through a potential difference 
V 0 volts, then the gain in the kinetic energy of the particle is equal 
to qV 0 volts. It is convenient to measure the kinetic energies of 
moving charged particles in terms of the potential difference through 
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which a particle of electronic charge would have to be accelerated 
in order to have the same kinetic energy as the moving particles. 
If a charge of 1 C is moved through a potential difference of 1 V, 
the work done is equal to 1 J. The work done in moving a charge 
of 1-602 X 10 -19 G through a potential difference of 1 V is equal to 
1-602 X 10- 19 J. Hence 

1 eV = 1-602 X 10- 19 J = 1-602 X 10" 12 ergs (5.27) 

For /9-rays and y-rays it is often convenient to measure energies 
in thousands of electron volts or in keV. 

1 keV = 10 3 eV = 1-602 X 10“ 16 J = 1-602 X 10- 9 ergs 

For higher energy particles, it is convenient to measure energies 
in millions of electron volts or in MeV. 


1 MeV = 10 6 eV = 1-602 X 10~ 13 J = 1-602 X 10~ 6 ergs (5.28) 


For particles of still higher energy, energies are measured in BeV 
or GeV, where 


or 


1 Be VI 
1 GeVj 


= 10® eV = 1-602 X 10- 10 J = 1-602 X 10~ 3 ergs 


Generally, the energy quoted refers to the kinetic energy of the 
particle, but at other times, particularly in /3-decay work, the quoted 
energy often includes the rest mass energy of the particle. 

Now in the equation 

E 2 = c 2 p 2 + my (5.22) 


both cp and m 0 c 2 have the dimensions of energy and can be expressed 
in eV or MeV. For example, m 0 c 2 has the value 0-51098 MeV for 
an electron. Some texts simply quote the masses of particles in 
MeV, implying that this stands for the quantity m 0 c 2 . 

As an example of the appropriate units for momentum, the 
momentum of an electron of kinetic energy 2 MeV will be cal¬ 
culated. Since m 0 c 2 = 0-511 MeV, E the total energy of the 
electron is 2-511 MeV. Using eqn (5.22) and working in MeV 
(2-511) 2 = c 2 p 2 + (0-511) 2 

cp = (6-305 - 0-261)* = 2-46 MeV 
Momenta are generally written in the form 

p = 2-46 MeV/c 

and it is said that momenta are measured in units of eV/e or MeV/c. 

As a further example of the use of eqn (5.22) the rest mass of a 
charged particle which has a momentum of 300 M eV/c when its 
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kinetic energy is 190 MeV will be calculated. From eqn (5.22) 

E 2 = (T + m 0 c 2 ) 2 — c 2 p 2 + mlc 1 

Rearranging, 

m a c 2 = {c 2 p 2 - T 2 )/2T = (300 2 - 190 3 )/2 x 190 

= 142 MeV 
or 

m 0 = 142 MeV/r 2 

This is about 277 times the rest mass of an electron, and the particle 
is probably a 7r-meson. 

In nuclear physics the basic unit of mass was, until recently, the 
atomic mass unit or a.m.u. This was defined as of the rest 
mass of the common isotope of oxygen. One a.m.u. was equal to 
1-6598 X 10~ 27 kg. According to the relation E = tn 0 c 2 , this rest 
mass energy was equivalent to 

E = m 0 c 2 = 1-66 x 10- 27 x (3 x 10 8 ) 2 = 1-491 x 10- 10 J 

Since 1 MeV is equivalent to 1-602 X 10“ 13 J, 

. 1-491 x lO" 10 

1 a - m ' U ' " r-602 x I0-» - 931 M ' v /^ 

Hence 

1 a.m.u. b (1/16) le O = 1-6598 X 10- 27 kg = 1-491 x 10~ 10 J/ C 2 
= 931 MeV/c 2 ( 5 . 29 ) 

The energy equivalent to the rest mass of the electron and the proton 
are 0-511 MeV and 937 MeV respectively. The Eleventh General 
Congress on Weights and Measures suggested that a new unit of 
atomic mass should be adopted, namely a unit which is equal to 
12 the rest mass of the 12 C isotope. One new a.m.u. = i mass 
12 ^ ~ T6603 x 10~ 27 kg. The new scale will gradually come into 
common usage. 

5.3.4. Trigonometrical Methods 

Mathematical calculations can often be simplified by using the 
trigonometrical substitution 

sin 0 = u/c (5.30) 

where u is the velocity of the particle. We then have 

m 0 

(5 - 31 > 

125 


Tftiit/tewtaticixZ P/tulteA 



RELATIVISTIC MECHANICS 


The increase of mass with velocity is obtained by plotting m/m 0 
against u/c, that is, sec 0 against sin 6 as shown in Figure 5.3(a). If 
the factor 1/(1 — m 2 /c 2 )* = sec 6 is given, then u/c = sin -1 6 can 
be determined from trigonometrical tables, and vice versa. 

The total energy of a particle is given by 

E = me 1 = mp 1 sec 6 (5.32) 

The kinetic energy is given by 

T = E — mp 2 = m^(scc 0—1) (5.33) 

Now, 

c 2 p 2 = E 1 — mjjc 4 = m^c 4 (sec 2 0 — 1) = mjc 4 tan 2 0 
or 

cp = rriff 2 tan 0 (5.34) 


A 




Figure 5.3. (a) sec 0 ij plotted against sin 6. This is the same as 
plotting the ratio of the relativistic mass m of a particle to its rest mass 
m 0 against the ratio of the velocity of the particle to the velocity of 
light, (b) The relations between E, T, p and m Q are shown graphically 


The relation E 2 = p 2 c 2 + m 2 c 4 can also be represented graphically. 
The quantities E, pc, and m „c 2 form the sides of a right-angled 
triangle as shown in Figure 5.3(b). We have 


pc 


muc 


sin 0 = — = = u/c 


mc‘ 


It is left as an exercise for the reader to show that eqns (5.32), (5.33) 
and (5.34) can be developed from Figure 5.3(b) using trigonometrical 
methods. 

The kinetic energy calculated from the equation T — %m a u 2 
deviates from the relativistic value by 1 per cent for an electron of 
3-2 keV, a /i-meson of 0-66 MeV and a proton of 6T MeV. It can 
be seen that relativistic effects become important for electrons when 
they are accelerated through a potential difference of a few kilo¬ 
volts. If the same value of velocity is used in both cases, the kinetic 
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energy calculated using the non-relativistic formula is only 50 
per cent of the value predicted by the relativistic formula for kinetic 
energy when electrons, p-mesons and protons have relativistic 
kinetic energies of 0-31 MeV, 64 MeV and 590 MeV respectively. 

5.4. SOME APPLICATIONS OF THE RELATIVISTIC 
DYNAMICS OF A SINGLE PARTICLE 

5.4.1. Motion in an Electric Field (No Magnetic Field) 

It will be assumed that there is a uniform electric field of strength 
E V/m in the positive x direction between the plates of a large 
parallel plate capacitor. If the capacitor is charged to a potential 
difference of V 0 volts then the electric field is given by 

E = E X = V 0 ld (5.35) 

where d is the separation of the plates measured in metres. If it 
is assumed that the value of the total electric charge on a particle 
is independent of the velocity of the charge, then the electric force 
on a charge -\-q coulombs in the electric field E is given by 

f = <?E newtons 

whatever the velocity of the charge. In general one has 

t-U)-* 

d£ W1 — u 2 lcV 
For motion in the x direction, 

d / u x \ _ £ 

M\/l - uHcV 9 m 0 

Integrating, 

u x qE „ 

VT=H? = k t + C ' 

where C x is a constant of integration. If it is assumed that u x = 0 
at t = 0, then C t = 0. Rearranging one has 



that is 

a _£E*K_ 

“* [1 + q 2 E 2 t 2 /m 2 c 2 ] 

or 

dx qEt c 

Ux== dt = m„[ 1 + q 2 E 2 t 2 lm 2 c 2 ]t = {m 2 p 2 lq 2 EH 2 + 1}* ^' 36) 
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As t —> oo, the speed of the charged particle approaches the speed 
of light. An experimental check of this result is described in 
Section 5.4.2. Eqn (5.36) gives the velocity of the charge at any 
time t. Integrating eqn (5.36) 


, = f- 

J m o 


qEt d£ 


[1 + q 2 E 2 t 2 lmy]l 


m 0 c 2 

= . 


q 2 E 2 t 2 ' 

1 


+ C 2 


where C 2 is a constant of integration. If x = 0 at t = 0, then 


n m o°- , r r m o c 

° = ~^E + ° 2 ° r C *~~~qE 


Hence, 




q 2 E 2 t 2 V> 
+ m%c 2 / 


(5.37) 


This expression gives the distance the charge moves in a time t. 
Eqn (5.37) can be rewritten as 


”V 2 \ 2 _ cH z = ^ 


qE) 


q 2 E 2 


(5.38) 


This is the equation of a hyperbola; for this reason this type of 
motion is sometimes called hyperbolic motion, the graph of x 
against t being a hyperbola. 

If qEtjm 0 is very much less than c, then eqn (5.36) becomes 


( q -\t 

U * l«o/ 


Now, qElm 0 is the non-relativistic value for the acceleration, so 
that eqn (5.36) reduces to the classical case when both the accelera¬ 
tion and the time are small. Expanding eqn (5.37) by the binomial 
theorem one has 


TflyC 


i + k 2 


EH 2 

m 2 c 2 


- 1 


If qEt/m 0 < c, this reduces to 

‘-•(9 


t 2 


This is again in agreement with the non-relativistic value. 

The total work done on the charge by the electric field is given by 


work done = J F x dx = J" qE x dx 
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But E x = —where <f> is the electrostatic potential. Hence, the 
work done on the charge is equal to 


/•* d<j> 

~ q )o ~dx dX ~ = ?r °j° uIes 

where V 0 is the potential difference through which the charge has 
moved. Equating the work done on the charge to the gain in 
the kinetic energy of the charge, 


T = gV 0 (5.39) 

5.4.2. Speed and Kinetic Energy of Relativistic Electrons 

The experimental arrangement used by Bertozzi 2 (1964) to 
measure the speeds of high energy electrons, accelerated by an 
electric field, is shown in Figure 5.4. Bursts of electrons lasting 
about 3 X 10~ 9 sec, were accelerated, by the electric field in a Van 
de Graaff electrostatic generator, up to energies of T5 MeV. The 
electrons were then injected into an electron linear accelerator. 
Between the Van de Graaff and the linear accelerator there was a 
short insulated metal tube which collected some of the electrons in 



Electron burst charge 
distribution as a 
function of time 


Figure 5.4 . Experimental arrangement used by Bertozzi 2 to measure the time of flight of a 
burst of electrons accelerated by a Van de Graaff generator 
(By courtesy of Am. J. Physics) 
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each burst passing through it, giving rise to a voltage pulse which 
signalled the start of the electron burst down the flight path, which 
in this experiment was the metal pipe forming the linear accelerator. 
At the end of the linear accelerator, after a flight path of 8-4 m, the 
electrons were stopped in an aluminium disk. The charging of this 
disk produced a second electrical signal. The two electrical signals, 
one from the beginning and one from the end of the flight path, 
were transmitted along two separate cables of equal length, and 
therefore of equal transit times, to a cathode ray oscilloscope. The 
calibrated sweep of the oscilloscope was triggered earlier by another 
signal transmitted from the beginning of the flight path via a third 
shorter cable. The separation on the oscilloscope trace of the two 
pulses transmitted by the equal length cables gave the time of 
flight of the electrons over a distance of 8-4 m. 

Kinetic energies up to 1-5 MeV were obtained from the Van de 
Graaff without the linear accelerator turned on. To a good 
approximation the linear accelerator served only as an evacuated 
metal enclosure with no accelerating electric fields. If t is the time 
electrons take to travel a distance of 8-4 m, their speed is 8-4 jt. 
The results obtained are tabulated and shown graphically in 
Figure 5.5. The speed measurement at 4-5 MeV was performed 



Figure 5.5. Experimental results obtained by Bertozzi a . The 
solid curve represents the prediction of ( ujc ) a according to 
Newtonian mechanics, ( u/c)‘ = 2Tlm 0 c i . The dashed curve 
represents the prediction of the theory of special relativity: 
( u/c )* = 1 — [m 0 c‘l(T + moC®)]* where m 0 is the rest mass 
of the electron and c = 3 X 10 s m/sec is the speed of light 
in a vacuum. The solid circles are the experimental points. 
They agree with the theory of special relativity, and show that 
c is the maximum speed for electrons 
(By courtesy of Am. J. Physics ) 
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with the Van de Graaff set at 1-5 MeV and with the first section 
of the linear accelerator (about 1 m long) being used to supply 
the additional 3 MeV. In this case the speed of the electrons 
varied over the first metre of the 8-4 m flight path. The results 
presented in Figure 5.5 show that, as the kinetic energy of the 
electrons increases, the speed of the electrons approaches the speed 
of light. The speed of electrons only increased by 14 per cent 
between kinetic energies of 0-5 and 4-5 MeV. In the 15 MeV run, 
the electron energy was increasing continually over most of the flight 
path. However, the time of flight was almost the same as for 
4-5 MeV electrons showing that the speed of the electrons did not 
increase appreciably between 4-5 and 15 MeV. The results of 
Bertozzi show conclusively that the speed of light is the limiting 
speed for electrons. If space is isotropic, the limiting speed of 
electrons should have the same value in all directions of space. 

From eqn (5.19) 


Rearranging, 


me 2 = 


Vl - u 2 Jc 2 


T + mgc 2 


u 2 f rriff 2 ) 2 

c 2 i (r+m oC 2 )j 


(5.40) 


Eqn (5.40) is plotted in Figure 5.5. The kinetic energy T was 
assumed to be equal to J^E . dl where E is the electric field, q the 
charge on an electron and dl is an element of length of the electron 
path. It can be seen that the experimental results of Bertozzi are 
in agreement with the variation of speed with kinetic energy pre¬ 
dicted by the theory of special relativity. 

Using a thermocouple, Bertozzi measured the temperature rise 
and the heat generated in the aluminium disk by the electrons 
when they are stopped at the end of the flight path. From the 
charge collected by the aluminium disk, the total number of 
electrons was estimated. Assuming that all the kinetic energy of 
the electrons goes into heat, the kinetic energy of the electrons was 
determined. Energy measurements were performed at the accel¬ 
erator settings of 1 -5 and 4-5 MeV. The corresponding kinetic 
energy values obtained by the direct thermal technique were T6 
and 4-8 MeV respectively. These values agreed within the experi¬ 
mental error of 10 per cent with the values for kinetic energy 
calculated from JyE . dl. They showed that, though the speeds of 
electrons increased only slightly between the 1-5 and 4-5 MeV 
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settings of the accelerator, the kinetic energies of the electrons 
increased by a factor of 3. 

If the laws of high-speed mechanics are to obey the principle of 
relativity, the analysis of Section 5.4.1 and the experiment of 
Bertozzi 2 should be valid in all inertial frames. For example, 
consider a rocket moving with a velocity +V relative to the earth. 
If the laws of high-speed mechanics obey the principle of relativity, 
according to the analysis of Section 5.4.1, when it is accelerated in 
an electric field, an electron starting from rest relative to the rocket 
should tend to a limiting speed relative to the rocket. If the electric 
field, relative to the earth is parallel to — v such that the acceleration 
of the electron is parallel to +v, the analysis of Section 5.4.1 should 
be valid relative to the earth also. The speed of the electron 
should tend to a limiting value c, relative to the earth. This 
limiting speed, or ultimate speed, relative to the earth, should be 
independent of the speed of the rocket, relative to the earth. 
Hence, the speeds of accelerated electrons should tend to a limiting 
value in all inertial frames. If all inertial frames are equivalent, 
the limiting speed for accelerated electrons should have the same 
numerical value in all inertial frames, provided the same funda¬ 
mental units of length and time are used. For example, if the 
numerical value of the limiting speed of accelerated electrons, 
measured in the rocket moving relative to the earth, depended on 
v the speed of the rocket relative to the earth, one could carry out 
experiments to find out how the limiting speed of accelerated 
electrons relative to the rocket varied with the speed of the rocket 
relative to the earth. Thereafter, by measuring the limiting speed 
of accelerated electrons inside a moving rocket, one could estimate 
the speed of the rocket relative to the earth, without looking at 
anything external to the rocket. This is contrary to the assumption 
that, in inertial reference frames, the laws of mechanics obey the 
principle of relativity. Therefore, if the laws of high-speed mech¬ 
anics are to obey the principle of relativity, we conclude that the 
limiting speed of accelerated electrons is the same, and equal to 
the terrestrial value of c — 3-0 X 10 8 m/s, in all inertial reference 
frames. This result will be called the principle of the constancy of the 
limiting speed of particles. This principle together with the principle 
of relativity can be used to develop the Lorentz transformations in 
the way outlined in Appendix 3(b). By treating light as photons 
travelling at the limiting speed, it will be illustrated in Section 5.8.6 
that the principle of the constancy of the speed of light is a special 
case of the principle of the constancy of the limiting speed of 
particles. 
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5.4.3. Motion in a Magnetic Field (No Electric Field) 

The force on a particle of charge q coulombs moving with 
velocity u m/sec in a magnetic field of strength B Wb/m 2 is equal to 

f=juxB newtons 

where u x B is the vector product of u and B. This force is per¬ 
pendicular to both u and B; consequently the scalar product f .« 
is zero so that eqn (5.15) becomes 

dT d dm 

f ' u= dF = di (m - 7 ”» )c2 = e d7 = 0 


Thus the kinetic energy and relativistic mass of a charged particle 
moving in a magnetic field are constant. Hence the speed and 
momentum of the charged particle are also constant, and 


f = -( mu ) =• 


nzdu udm mdu 


d t + dt 


dt 


= ma 


so that 


du 

f = ma. — m-r- — 


m n 


du 


dt Vl — u % jc 2 dt 


= (/u x B 


(5.41) 


The acceleration is in the same direction as the force in this case. 
The equation of motion is exactly the same as the classical case 
except that m 0 is replaced by m 0 /( 1 — h 2 /c 2 )* which remains constant. 

We shall start by considering motion in a uniform magnetic field. 
Let the initial velocity of the particle be perpendicular to the 
direction of the magnetic field as shown in Figures 5.6(a) and (b). 
Let the magnetic field be in the negative z direction. The accelera¬ 
tion is always perpendicular to u so that the velocity of the particle 
is constant in magnitude. In a uniform magnetic field the accelera¬ 
tion is equal to quBjm and is a constant so that the charged particle 
moves in a circle as shown in Figures 5.6(a) and (b). If the radius 
of the circular motion is p, then the centripetal acceleration is equal 
to u 2 lp as illustrated in Figures 5.6(b) and (c). 

Substituting in eqn (5.41) 


or 


/ = 



mu 2 
P 


quB 


mu = q(Bp) = p 
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Path when U is not perpendicular to B 



Otherwise it is a helix 

(a) (b) 



or 0-#-•£* 

St p 


(C) 


Figure 5.6 . The motion of a positive charged particle in a uniform magnetic field is a circle, 
if the initial velocity of the particle is perpendicular to the magnetic field B. When u is not 
perpendicular to B the path is a helix 


If a charged particle moves in a circle of radius p in a uniform 
magnetic field B then the time for one complete revolution is 
equal to 


277pm 277pm 277 m 

2irp u = —— = —-— = —- 


(5.43) 


If the variation of the mass of the particle with velocity can be 
neglected, this time is independent of the velocity of the particle. 
According to eqn (5.42), the radius of the orbit is proportional to the 
momentum of the particle. 

By measuring the radius of curvature of the orbit of a charged 
particle moving in a direction perpendicular to a uniform magnetic 
field, the momentum of the particle can be determined using eqn 
(5.42). The greater the momentum, the larger the radius of 
curvature. The quantity Bp is called the magnetic rigidity. Con¬ 
sider a particle of charge Zq, where q = 1 -602 X 10~ 19 C is the 
electronic charge. If B is in Wb/m 2 and p is in metres, then from 
eqn (5.42) 

pc = qBpc = Z x 1-602 X 10 ~ 19 Bp X 3 X 10 s J 

But 

1J = 1/1-602 X 10~ 13 MeV 

Hence 

p = 300 ZBp MeV/c 
If B is in gauss and p is in centimetres 

p = 300 ZBp eV/c 
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As an example consider a charged particle of charge equal to an 
electron moving in a direction perpendicular to a uniform magnetic 
field of 1 Wb/m 2 . If the radius of curvature of its orbit is 105 cm, 
calculate its momentum. If its velocity is 0-95c, find its rest mass 
and kinetic energy. 

From eqn (5.44) 

p - 300 Bp = 315 MeV/c 

Now 


cp = 


m 0 uc 


O ^ m ,/: 2 


Hence 


or 


VI - u 2 jc 2 VT - 0-95 2 

3-202 X 0-95moC 2 = 315 MeV 
m 0 c 2 = 103-5 MeV 


3-202 X 0-95moC 2 


This corresponds to a negative particle of mass ~203 times the 
electron rest mass. It is therefore probably a //"-meson. Now 

E 2 = c 2 p 2 + my = 315 2 + 103-5 2 

E = 331-5 MeV 


T = E — m 0 e 2 = 228 MeV 


It is left as an exercise for the reader to repeat the calculation using 
the trigonometrical formulae developed in Section 5.3.4. 

If the velocity of the charged particle is not perpendicular to the 
uniform magnetic field, then the velocity of the particle can be 
resolved into two components, one component un parallel to the 
magnetic field and the other component u ± perpendicular to the 
magnetic field. If there is no electric field present, un is unchanged 
in magnitude and direction. This velocity must be compounded with 
the circular motion perpendicular to the magnetic field. The result¬ 
ing motion is a helix. 

The motion of ions in a non-uniform magnetic field can also be 
calculated from the equation 


d / m 0 n \ 
d^Wl - « 2 /c 2 / 


r/U X B 


(5.46) 


When the magnetic field is not uniform and especially when both 
electric and magnetic fields are present, it is generally more con¬ 
venient to use Lagrange’s equations and start from the relativistic 
Lagrangian. This method is discussed by Rosser 3 Section 9.6 where 
it is shown that, if the Lagrangian for a charged particle of velocity 
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u is defined as 

L = m 0 c 2 ( 1 — Vl — m 2 /c 2 ) -f- ?(w • A) — q<f> (5-47) 

where </> is the scalar potential and A is the vector potential of the 
electromagnetic field, then Lagrange’s equations give the correct 
relativistic equation of motion. 

It is shown by Rosser 8 (Section 9.7) that if the Hamiltonian is 
defined as 

H— q</> + c{m\c 2 + (p — ?A) 2 }* (5.48) 

then Hamilton’s equations give the correct relativistic equation of 
motion for a charged particle. 


5.4.4. Experimental Verification of the Variation of Mass with Velocity 

The ratio ejm for electrons was measured by Thomson in 
1897 using cathode rays. In 1901 it was shown qualitatively by 


y ♦ 


B is downwards 



X X X X X X 

Figure 5.7. Simplified diagram of Bucherer^s experiment . The capacitor 
acts as a velocity selector. After emerging from between the plates of the 
capacitor, the fi-rays are deflected in a magnetic field and detected by a 
photographic plate 


Kaufmann, using the parabola method, that the value of ejm 
depended on the velocity of /J-rays. In 1908 Bucherer 4 carried out 
more accurate measurements of ejm for /3-rays. The principle of 
Bucherer’s experiment is illustrated in Figure 5.7; /8-rays emitted 
from a radium source passed between the plates of a large capacitor. 
The diameters of the plates of the capacitor were very much bigger 
than the distance between the plates. The whole apparatus was 
placed inside a vacuum. A potential difference was applied to the 
plates of the capacitor so as to produce an electric field in the 
negative^ direction as shown in Figure 5.7. The electric force on a 
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negative electron was equal to Ee and acted in the positive y 
direction. A magnetic field was applied in the negative z direction, 
that is, perpendicular to the paper away from the reader, in Figure 
5.7. The magnetic force on an electron moving along the positive 
x axis was equal to euB, and was in the negative.? direction. If the 
electric and magnetic forces acting on such an electron were not 
equal, then an electron moving initially along the x axis would be 
deflected in either the positive or negative.? direction and would not 
emerge from between the plates of the capacitor. If the electric 
and magnetic forces were exactly equal, the electron would not be 
deflected. It is only in this case that an electron would emerge from 
between the plates of the capacitor, if the electron were moving 
initially along the x axis. For this case 


euB = eE 
or 

« = E/B (5.49) 

The capacitor acted as a velocity selector in Bucherer’s experiment. 
There was no electric field outside the capacitor, so that the 
electrons, which emerged from the velocity selector, moved in 
circular orbits in the magnetic field before striking the photographic 
plate, as shown in Figure 5.7. If p is the radius of the circular path 
outside the capacitor plates, 


or 


d{2p - d) = D 2 

D 2 + d 2 
P ~ 2d 


(5.50) 


where the quantities d and D are as shown in Figure 5.7. 
(5.42) for motion in a magnetic field one has 


Thus 


„ mu 

mu — qBp or p = — 

D 2 -f- d 2 mu 
2d = ~Be 


From eqn 


But from eqn (5.49), u = EjB , so that 

e 2d E 

m (D 2 + d 2 ) B 2 ( 5 - 51 ) 

If all the lengths are measured in metres, E in volts per metre and B 
in webers per metre 2 , then e/m is in coulombs per kilogram. Bucherer 
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reversed the electric and magnetic fields and obtained a second 
spot on the photographic plate. Bucherer took d to be equal to half 
the distance between the two spots. The experiment was repeated 
for electrons having different velocities. Some of the results obtained 
by Bucherer are given in Table 5.1. It can be seen that the values 
of e/m = e(l — tt 2 /c 2 )*/m 0 varied with the velocity of the /3-rays; 
the velocities of the /3-rays were calculated using eqn (5.49). The 
calculated values of ejm Q were constant. Similar experiments have 


Table 5.1. Bucherer’s results for ejm for /J-rays 


ujc 

e/m — 

Wl - u a /c* 

«/m 0 


m 0 

SB T flfti 

1-661 

X 10“ C/kg 

1-752 

x 10“ C/kg 

7: 

1-630 

1-761 

Sit£3rBnHI 

1-590 


1-760 


0-5154 

1-511 


1-763 


0-6870 

1-283 


1-767 



been performed many times since, and the results obtained have 
always been consistent with the equation 


m „ 


" = (5 - 52 > 

It is generally accepted that these experiments confirm eqn (5.52) 
for the variation of mass with velocity. However, other postulates 
were used in the theory of the experiment; for example, it was 
assumed that the force on the moving charge was given by the 
Lorentz force and that the electric charge of the particle was 
independent of its velocity (the principle of constant electric charge). 
It is preferable to think of the results of these experiments as one 
illustration of the correctness of the relativistic theory for the 
motion of ions in electric and magnetic fields taken as a whole, 
rather than as a confirmation of one particular part of the theory. 
There is, however, independent evidence in favour of the principle 
of constant charge. If the magnitude of the total charge on a 
particle did vary with the velocity of the particle, for example in a 
way similar to the variation of mass with velocity, then hydrogen 
atoms and molecules would not be electrically neutral, since, on 
the average, the electrons in hydrogen atoms and molecules have 
higher velocities, relative to the laboratory, than the protons. 
King 5 (1960) showed that the charges on the electrons and the protons 
in hydrogen molecules are numerically equal to within 1 part in 
10 20 . King also showed that the positive protons in helium nuclei 
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cancelled the charge of the orbital electrons with nearly the same 
experimental accuracy. The principle of constant electric charge 
is implicit in Maxwell’s equations. (Rosser®, Section 3.3.) 

5.4.5. The Acceleration of Charged Particles to High Energies 

This account is not intended to be an exhaustive account of 
particle accelerators, but merely a brief outline of some of the 



Figure 5.8. A simplified diagram of the cyclotron. The path of 
a typical ion is shown 

physical principles underlying the acceleration of particles in the 
energy range where the variation of mass with velocity becomes 
important. For a detailed account of accelerators the reader is 
referred to McMillan 6 where references to other works are given. 

The cyclotron is shown in schematic form in Figure 5.8. It 
consists of two flat hollow conducting semi-circular boxes (which are 
generally called dees). The two dees are placed between the poles 
of an electromagnet and are inside a high vacuum. Ions are pro¬ 
duced from the source S which is in the centre between the two dees. 
A potential difference is applied between the dees such that there 

139 


~PuJuc. Tfeat/tewtafiea/ P/t yltc. 1 . 





RELATIVISTIC MECHANICS 


is an electric field in the gap between them. The ions are accelerated 
from the ion source towards one of the dees. As the dees are hollow 
conductors the electric field inside them is negligible, so that once 
they are inside one of the dees, the velocities of the ions remain 
constant and the ions move in circular orbits inside the dees. After 
going through a semi-circle, the ions reach the gap between the 
two dees. If by this time the potential difference and the electric 
field between the dees are reversed in direction, the ions are 
accelerated again and move in a semi-circle of larger radius inside 
the other dee before reaching the gap again. According to eqn 
(5.43) the time for one complete revolution in a uniform magnetic 
field is equal to 2ttp/u = 2nmjqB, so that the time for half a revolu¬ 
tion is equal to 7 rm/Bq. This time is independent of the energy of 
the particles, provided the variation of the mass of the particles with 
velocity can be neglected. If the electric field between the dees 
reverses in direction at regular intervals of irm/Bq, then the ions 
will be accelerated each time they cross the gap between the dees 
gaining energy each time. The radius of the orbit of the ions is 
given by 

mu 

P = Jb (5-42) 


As the velocity of the ions increases, then the ions go in semi-circles 
of ever increasing diameter. The alternating potential difference 
can be obtained from a high frequency oscillator. The condition for 
resonance is 


qB qB \/1 — u 2 jc 2 
2nm 2 t rm 0 


(5.53) 


where n is the frequency of the oscillator. 

Provided the variation of mass with velocity can be neglected 
the time of revolution of the ions in a uniform magnetic field 
remains constant and the ions can be accelerated by a fixed fre¬ 
quency oscillation, of frequency given by eqn (5.53). For protons, 
if n = 10 7 c/sec then the magnetic field must be 0-65 Wb/m 2 
(6,500 G). 

At high energies the mass of the ions increases with velocity and 
the time of revolution increases. If the frequency n remains fixed, 
the ions will get out of phase. The variation of mass with velocity 
is small enough for protons below 40 MeV for them to be accelerated 
without getting completely out of phase. It is not possible to 
accelerate electrons beyond ~50 keV before they get out of phase. 
In principle the loss of resonance can be overcome in two ways. 
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As the energy of the ions increases the relativistic mass increases and 
the resonant frequency goes down. If the frequency of the applied 
potential difference is modulated appropriately, then it should be 
possible to accelerate ions to higher energies. One of the first 
frequency-modulated cyclotrons (or synchrocyclotrons) was the 
184-inch machine developed at the University of California. This 
produced protons of energies up to 350 MeV, deuterons of energies 
up to 195 MeV and a-particles of energies up to 390 MeV. For 
protons the frequency had to be changed from 22-9 Mc/sec at the 
instant of injection to 15-8 Mc/sec when the protons reached the 
outside of the dees. The corresponding frequencies for deuterons 
and a-particles were 11-5 and 9-8 Mc/sec. The synchrocyclotron 
is not suitable for accelerating electrons as the rate of modulation 
required would be too high. 

The other method of overcoming loss of resonance is to increase 
the magnetic field, after the ions have been injected, at such a rate 
that the ratio B/m = .6(1 — a 2 /c 2 )*/m 0 remains constant. The 
frequency need not then be changed. This method has been applied 
successfully to the acceleration of electrons in the electron synchro¬ 
tron. The electrons are injected at energies of a few tens of keV 
and are accelerated initially by betatron action up to a few MeV, 
when the synchrotron action takes over and the electrons are finally 
accelerated up to 300-400 MeV. One of the advantages of the elec¬ 
tron synchrotron is that, at 2 MeV, electrons move with practically 
the speed of light (~0-98 c). If B/m is a constant, the radii of 
the orbits given by eqn (5.42) do not increase very much as the 
energies of the electrons increase, since above 2 MeV the velocities of 
the electrons remain practically constant. For this reason, one need 
only use a doughnut-shaped vacuum chamber and the magnetic 
field need only extend over the dimensions of this vacuum chamber. 
Protons do not approach the speed of light until they have an 
energy of a few GeV; for example, protons do not reach the same 
speed as 2 MeV electrons until their energy is ~4 GeV. If rti/B 
were kept constant in a proton synchrotron, such that a constant 
frequency could be used, then the radii of the proton orbits would 
increase substantially as the protons were accelerated from a few 
MeV up to a few GeV. This would necessitate building an enormous 
magnet. In order to keep the protons in orbits of fixed radii both 
the frequency of the potential difference and the strength of the 
magnetic field are varied. The C.E.R.N. proton synchrotron can 
accelerate protons up to an energy of 25 GeV. 

When electrons move in circular orbits they emit photons 
(synchrotron radiation) due to their centripetal acceleration, and 
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this becomes very important at energies above 300 MeV. Electrons 
have been accelerated to energies greater than 600 MeV in the 
Stanford linear electron accelerator. In this accelerator the 
electrons are accelerated by the electric vector of a travelling 
electromagnetic wave. The linear electron accelerator is designed 
such that the speed of the electromagnetic wave increases from the 
speed of the electrons at injection towards the speed of light. When 
the electron energy is greater than 600 MeV its relativistic mass is 
greater than 1,200 times the electron rest mass. 

The design of high energy accelerators depends on the equations 
of the relativistic dynamics of charged particles developed in 
Section 5.3. The successful design of these accelerators shows that 
the theory developed, in which it is assumed that the inertial mass 
varies with velocity, that the electric charge is constant and that 
particle velocities are less than the velocity of light, is satisfactory 
up to very high energies. 


5.5. THE TRANSFORMATION OF THE MOMENTUM 
OF A PARTICLE AND THE FORCE ACTING ON 
A PARTICLE 

In Sections 5.3 and 5.4 the motion of charged particles in electric 
and magnetic fields was discussed and it was shown that when mass, 
momentum and force are redefined as described in Section 5.3, 
then the new theory is able to predict the motion of charged particles 
up to extremely high energies. The equations were used in the 
laboratory system. It is sometimes more convenient to carry out 
the calculations in an inertial frame which is moving relative to the 
laboratory, such as, for example, the centre of mass system. 


5.5.1. Transformation of Momentum and Energy 

In the inertial frame £ the momentum of the particle is defined 
by the relations 


m n u. 


mu - = 


Vl - u 2 lc*' Pv Vl - u*lc*’ 

and the total energy is defined by the relation 

A 


m n u, 


o u v 


m n u, 


Vl — u 2 /c 2 


£== 


A — 




Vl — u 2 jc z 


The corresponding quantities in £’ are defined as 


Px — m u x — -y/]~ 


u’ 2 jc 


Pv = 


l o u v 


m„u , 


Vl - u’ 2 /c 2 ’ lz Vl - u 2 jc 2 
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and 


E' = m'c 2 


\ 1 - a' 2 /c 2 


Substituting for 1/(1 — a' 2 /^ 2 )* from eqn (4.14) and substituting 
u x = ( u x — v)l( 1 — vujc 2 ) [eqn (4.5)] into the expression for p' x , 
one obtains 

, _ m o( u * ~ v ) (! - vujc 2 ) 

Px (1 - vujc 2 ) Vl _ pi/eVi _ a 2 /c 2 

_ ffl o (“»~”) 

Vl — a 2 /c 2 V1 — a 2 /c 2 

= y{wu x — mv) 


But mu x = p x and m = E/c 2 

Hence p' x = y(p x — vE/c 2 ) (5.54) 


Substituting for 1/(1 — a' 2 /^)* from eqn (4.14) and substituting 

— « P /V into the expression for A™ one obtains 

Vujc 2 ) 

m 0 (l — vujc 2 ) u y V 1 — e 2 /c 2 
vujc 2 ) 


“»(T 

Py = 


Vl - u' 2 jc' 


'2/,2 “ yT 


v 2 jc 2 V 1 - a 2 /c 2 (1 


m a u v 

Vl - u 2 jc 2 


Pv 


that is 

Similarly 

Now 



£' = m 'c* = 


m 0 (! — vujc 2 ) c 2 


V 1 — a' 2 /e 2 Vl - v 2 /c 2 V 1 — a 2 /c 2 


(5.55) 

(5.56) 


= y(£ - 

Hence 
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Similarly one has the inverse relations 


/ . vE'\ 

(5.59) 

Px = y\Px + -p) 

> 

II 

(5.60) 

Pz=P'z 

(5.61) 

E = y{E' + vp' x ) 

(5.62) 


It will be seen that p’ x , p' v , p' % and E'/c 2 transform in the same way as 
the space and time co-ordinates x', y', z', t'. For example, corre¬ 
sponding to eqn (5.54), x' = y(x — vt), and corresponding to eqn 
(5.58), t' = y{t — vx/c 2 ) 

Now 

p’* - E' 2 \c 2 = p' 2 + p' 2 + p'z - E’*lc* 

= y 2 (p x - vE/c 2 ) 2 + pi + p 2 - yHE - v Px ) 2 lc 2 
= Pl + Pi + P 2 , ~ E 2 /c 2 

= p 2 - E 2 jc 2 (5.63) 


Hence, the quantity p 2 — E 2 jc 2 is an invariant. It follows from 
eqn (5.22) that its numerical value is equal to — m 2 c 2 . 

A system of two colliding particles will now be considered. Let 
particle 1 have constant momentum p x and total energy E l and 
let particle 2 have constant momentum p 2 and total energy E 2 
relative to 27 before the collision. Let the corresponding values 
relative to 27' be Pi, K, p 2 and E’ 2 respectively. 

Let 

E = E x + E 2 ; £'=£( + E' 2 (5.64) 

and 

p = Pi + P2; p' = p! + P2 ( 5 - 65 ) 

Using eqn (5.54) 

(P'i)x + ( Pi)x = v\.(Pi)x - vEJc 2 ] + y[(p 2 ) x - vEJc 2 ] 

— y\_{Pi)x + iPz)x — v (Ei + e 2 )/c 2 ] 

or 

p’x = yVPx - vE/c 2 ] (5.66) 

Similarly, 

P'v = Py'> P'z = Pz, E ' = PIE - D Pxi ( 5 - 67 ) 


Hence, the sum of the momenta of the two particles and the sum 
of the total energies of the two particles before the collision transform 
in the same way as the momentum and total energy of a single 
particle. By repeating the argument leading to eqn (5.63), we have 
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for the sum of the momenta and the sum of the total energies of the 
two particles before collision 


P'2 _ = p2 _ £2 /c 2 (5 6 g) 

5.5.2. The Transformation of Force 

In the inertial frame £ the force acting on a particle is defined as 
a vector 

, . f = dp/dt (5.69) 

having components 

f x = dpjdt etc. (5.70) 

Similarly, in the inertial frame £', the force acting on the particle 
is defined as a vector having components 


fx = dp'Jdt' etc. 

Substituting for p' x from eqn. (5.54) into eqn. (5.71) 


Now, 


and 


,, d/ vE\ dt d/ vE\ 

fx y ) -? d f dt!/’* ~ c 2 ) 


l 


i 


l 


dt 

dt' dt'jdt d y( 1 — vu/c 2 ) 

y(t - vx/c 2 ) 


Using eqn (5.15) 


d Px! dt = fx 


(5.71) 

(5.72) 

(5.73) 


dEd d T 

dt ~ dt^ OT °^ = "d7 = f * u ~f°F x +fv u v +L u z 

Substituting in eqn (5.72) and rearranging, we obtain 


f* = fx - 


VU„ 


Since, 


(c 2 - vu x ) 

Py=P: 


fv- 


f' — _^pv — dt d p' 
Jv ~ dt' 


dt' 


vu z r 

(c 2 — vu x ) * 

(5.74) 


(5.55) 

\ dt 


f = d?^ 



Using eqn. (5.73), we obtain 


cVl - v 2 /c 2 r 
(c*-vu x ) 
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Similarly, _ 


cVl - 0 2 /c 2 

Jz ~ (c* - vuj Jz 

(5.76) 

relations are 


f vu 'y f + vu ' z f 

Jx-Jx-i- ^ + vu r ) J V 1 ( C 2 + ml ' x )J* 

(5.77) 

cW 1 - f 2 /* 2 , 

Jv ~ {c* + vui) Jv 

(5.78) 

cW 1 — » 2 /c 2 , 

(c* + vi 4) Jz 

(5.79) 


All the transformations derived in this section relate to the 
instantaneous values of the mass, energy and momentum of the 
particle and the force acting on the particle at some identifiable 
point on the path of the particle, the co-ordinates and time of this 
point in E and E' being related by the Lorentz transformations. 


5.6. THE THEORY OF ELASTIC COLLISIONS 

It was from a consideration of the collision of two elastic spheres that 
one was led to redefine the inertial mass of a moving particle to be 
equal to m 0 /( 1 — m 2 /^ 2 )*, where m 0 is the rest mass and u is the 
velocity of the particle. This led one in turn to redefine momen¬ 
tum and force, and using these new definitions it was possible to 
formulate a satisfactory theory for the relativistic dynamics of a 
single particle. The case of the elastic collision of two particles is 
now considered in more detail. By elastic collision is meant a 
collision in which, apart from the rest mass energies of the particles, 
all the energy is in the form of kinetic energy both before and after 
the collision. It will be postulated that in elastic collisions both 
linear momentum and kinetic energy are conserved. If it is assumed 
that the particles do not change their identity during the collision, 
then the rest masses of the colliding particles are the same before 
and after the collision. If the total kinetic energy is conserved in 
the collision then, since for each particle E — T + m 0 c 2 , the sum 
of the total energies of the two particles is also conserved in the 
collision. 

Let a particle of rest mass m 0 move along the x axis with velocity 
u 1; momentum p 1 and total energy and let it collide with a par¬ 
ticle of rest mass m 0 , which is at rest in the laboratory system. The 
laboratory system will be referred to subsequently as the lab.-system 

146 


~Pufuc. 7feaf/tcj«trttica/ 



THE THEORY OF ELASTIC COLLISIONS 


and denoted by E, as shown in Figure 5.9(a). The collision will also 
be considered from an inertial frame E', which is moving with 
velocity v relative to E along the common x axis, such that the total 
momentum of the two particles is zero in E’ before collision. This 
system is generally called the centre of mass system and denoted 
the c.m. system, though the term centre of momentum system 



Figure 5.9. The elastic collision between two particles having equal rest masses. Particle 2 
is at rest in £ before collision, whilst particle 1 moves with velocity u, relative to 2 before 
collision. In the c.m.-system £', the particles approach each other with the same velocity v 
before collision and rebound with equal and opposite velocities v after collision as shown in (i) 


is sometimes used. In the lab.-system the total momentum before 
collision is given by 

P = Pi + 0 = Pi 

and the total energy before collision is given by 


E — E 1 -\- E 2 — 


m n c* 


V1 — u\jc 


+ rrttf? 


m,,c‘ 


° IVl - af/c 2 


+ 1 


Using eqn (5.66) for the transformation of the sum of the momenta 
of the colliding particles from E to E' one has 

fix = Ylfix - vE/c*) 

Similarly, p' y = p z — 0. 

For the total momentum to be zero in E', one must have 


p x — vEjc 2 , where E = E 1 + E 2 
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or 


that is 


v 



m^u^c 2 

Vl — af/c 2 


m 0 c 2 


2 

1 


V1 — u\jc 2 



p x c 2 __ 

■E (1 + Vl — af/c 2 } 


(5.80) 


(5.81) 


This is the velocity of the c.m.-system relative to the laboratory. If 
the x component of total momentum and the total energy in the 
lab.-system are conserved in the collision, then the c.m.-system 
must continue to move with the same velocity v relative to the 
lab.-system after the collision. Since, in the c.m.-system, the total 
momentum is zero, before collision, we have 


Pi + Ps = 0 or Pi = “Pi 

It is being assumed that the rest masses of the two colliding particles 
are equal so that, if their momenta are numerically equal in Z' 
before collision, one must have u( = — u^, as shown in Figure 5.9(b). 
Since particle 2 is at rest in the lab.-system (Z) before collision, it 
must move along the negative O'x' direction with velocity v relative 
to Z' before collision as shown in Figure 5.9(b). Thus in the c.m.- 
system the two particles (which have equal rest masses) approach 
each other with equal speeds v before the collision. In the c.m.- 
system the sum of the total energies of the particles before collision 
is equal to 2m 0 c 2 /(l — v 2 /c 2 ) i . If momentum is conserved in the 
collision in the c.m.-system, then the total momentum after the 
collision must be zero so that the particles must rebound with 
equal and opposite velocities. If energy is also conserved in the 
c.m.-system each particle must have a total energy equal to 
m 0 c 2 /( 1 — v 2 /c 2 ) i after collision, so that they must both rebound 
with velocity v. However, they may rebound in different directions 
in the c.m.-system after collision, compared with their directions 
before collision. Let particle 1 be scattered such that it makes an 
angle a with the O'x' axis in the c.m.-system after collision as shown 
in Figure 5.9(b). Choose the orientation of they axis such that after 
collision particle 1 is in the x'y' plane, that is (p' a ) z = 0. It follows 
from the conservation of momentum that (p' a ) z + (p' 1 ) z = p' z = 0 in 
the c.m. system, so that (p' t ) z must also be zero and particle 2 must 
also rebound in the x'y' plane in Z'. The components of the velocity 
of particle 1 in Z' after the collision are 

(«s)® = v cos oc; (u' a ) v = v sin a; (*/')„ = 0 
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Transforming to S (the lab.-system) using the velocity transfor¬ 
mations, we have 


(“ 3 )* = 

(*4)® + y 

z» cos a + 0 

(1 + "“?■) 

[1 + (f 2 /c 2 ) cos a] 

(“ 3 ), = 

(«i).Vl - v 2 jc 2 

0 sin a V1 — n 2 /c 2 


1 + (v 2 /c 2 ) cos a 


From Figure 5.9(a) it can be seen that 


tan 0 


(a 3 )„ sin k V 1 — v*jc 


Mr (1 + cos a) 

For particle 2 in S', after collision, 

(»dx = c o s ( u i)v — sin *; K)* = 0 
Using the velocity transformations, 

-v cos « + v —v sin aV 1 — » 2 /c 2 


(5.82) 


i _ (oi/e*) cos 

Wi = 0 
so that 


a > ( u i)v J 


(» 2 /c 2 ) cos a 


tan <f> = 


~Mv sin a Vi - g8 / c2 


(««)* 0 - cos a) 

Multiplying eqn (5.82) by eqn (5.83), 

tan 0 tan <£ = ^ J L. ( C J = (1 - » 2 /< 2 ) 


(5.83) 


(1 — cos 2 a) 


Now from eqn (5.81) 


v = 


«i 


0 + V1 — a 2 /c 2 } 
i _ v *ic* = i - ^ + 

Hence, 


(5.84) 


1 + 


Vl - u 2 Jc 2 


tan 0 tan <f> = 


< 1 


1 +' 


If zq < c, then 


Vl — af/c 2 


Vl - af/c 2 
1 and tan 0 tan <j> — 1. 
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Now 


tan (0 + <f >) 


tan 0 -f- tan <f> 

I — tan 0 tan <f> 


so that if u x c, tan (0 + <f>) becomes infinite, and 


(5.85) 


e + <f> = l 

This is the expression given by Newtonian mechanics for the angle 
between two particles of equal rest mass after collision. It was 
shown by Blackett and Champion 7 that when a particles from a 
polonium source, for which ujc 1, were scattered in a helium 
filled cloud chamber the angle between the a-particles after 
collision was equal to tt/2. 

If « x is finite then tan 0 tan if> is less than unity, so that in eqn 
(5.85) tan (0 + <f>) is finite and (0 + <f>) is less than 7r/2. Champion 8 
investigated the scattering of electrons, having velocities in the range 
0-82c to 0-94c, by electrons in a cloud chamber filled with either 
nitrogen or oxygen. The momenta of the electrons before and after 
collision were determined from their curvatures in a magnetic field. 
Champion found that (0 + <f>) was less than 7r/2, generally by about 
10 degrees or more. The results could not be interpreted as elastic 
collisions between electrons on the basis of Newtonian mechanics, 
but the observed values for (0 + <f>) were in very good agreement 
with the predictions of the theory of special relativity for elastic 
collisions between electrons. Champion showed that the variation 
of (0 + <f>) from 73-/2 was in better agreement with the theory of 
special relativity than with the formula for the variation of mass 
with velocity suggested by Abraham (1903). 

The theory of special relativity says nothing about the probability 
that the two electrons collide or the probability of obtaining various 
scattering angles, but merely gives relations between the momenta 
of the particles after collision. For a more complete theory of the 
scattering process one would have to use relativistic quantum 
mechanics. It must be pointed out that, in our calculations, it was 
assumed that one could identify particles 1 and 2 before and after 
collision. According to quantum mechanics this is not possible, if 
both the particles are electrons. 

The solution of the general case of the elastic collision between 
two particles was given by Juttner 9 in 1914. Let the momenta of 
the colliding particles be p x and p 2 = 0 in the lab.-system before 
collision. Let their rest masses be m 0 and M 0 respectively and let 
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Figure 5.10. The elastic collision between a particle of rest mass 
m,, moving with velocity u t and a stationary particle of rest mass M„ 


their total energies before collision be E l and E % respectively. Let 
particle 1 have momentum p 3 after the collision and be scattered 
through an angle 0 in the lab.-system as shown in Figure 5.10. Let 
the struck particle rebound at an angle c/> with momentum p 4 in 
the lab.-system. According to the law of conservation of momentum 

Pi + 0 = p 3 + P 4 
Writing this out into components, 

Pi = P 3 cos 0 + Pi cos <j> (5.86) 

0 = Pz sin 6 — pi sin 0 (5.87) 

The law of conservation of energy gives 

•Ei + E z = E 3 + E 4 

that is 

cVp\ + + M 0 c 2 = cVp\ + mj;c 2 + cVpl + M 2 c 2 (5.88) 

If it is assumed that m 0 , p 1; M 0 , p 2 = 0 are given then there are 

four unknowns p 3 , p 4 , 6 and <f> in the three equations (5.86), 
(5.87) and (5.88). One of the four unknowns must be given if 
the problem is to be solved. For example, if the scattering angle 6 
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is given, then after some lengthy algebra one obtains: 

{ (m 2 c 2 + MqEj) cos 6 + (E 1 + M 0 c 2 )V M\ — m* sin 2 0 
Pa = Pi I Jp \2 

l (y + M o c J ~ Pi cos2 6 


(E x + M 0 c 2 )(m§c 2 + M 0 Ej) + c 2 p\ cos OVM\ — w 2 sin 2 6 


+ Af 0 cj — p\ cos 2 0 


' ' > (5.90) 

Pl=P\ J r Pi ~ 2 PiPs cos 6 (5-91) 

Alternatively, the energies and momenta after collision can be 
expressed in terms of <j>, the angle of recoil. One then has 
P-fiM 0 (E 1 + M 0 c 2 ) cos <f> 


r * “ /f \2 

M + M 0 cj - p\ cos 2 4 > 

2pf cos 2 (f>M 0 c 2 

E, = M 0 c* + , ^ -2 - 

cos 2 

Pi = Pi + Pi- 2 PiPi cos ^ 


(5.93) 


A comprehensive discussion of eqns (5.89)-(5.94) is given by 
Janossy 10 . 

The elastic scattering of two particles of unequal rest masses can 
also be discussed in the c.m.-system. The approach is similar to that 
adopted when discussing the example of the elastic collision of two 
particles of equal mass. The momenta of the colliding particles 
are again equal in the c.m.-system, but, since the rest masses of the 
particles differ, they do not approach each other with equal speeds 
in the c.m.-system. If the collision is elastic, the particles rebound 
with the same numerical values of momenta as they had before 
collision. The angle a the particles make in the c.m.-system with 
the direction of motion of the c.m.-system relative to the lab.- 
system is again a variable parameter. Experimental data on the 
elastic scattering of nuclear particles are often presented in terms of 
the probability of scattering between angles a and a + da in the 
c.m.-system. The momenta and energies in the c.m.-system can be 
related to the lab. values using the momentum and energy trans¬ 
formations [eqns (5.59)-(5.62)]. For low energy collisions (< 20 
MeV or so for nucleons, i.e. protons and neutrons) Newtonian 
mechanics and the Galilean transformations are sufficiently accurate. 
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5.7. INELASTIC COLLISIONS AND THE EQUIVALENCE 
OF MASS AND ENERGY 

In Section 5.6 elastic collisions were considered in which, apart 
from the rest mass energy, all the energy before and after collision 
was in the form of kinetic energy. In Newtonian mechanics, 
kinetic energy is not always conserved in collisions, but sometimes 
some of the kinetic energy is converted into heat and sound, whilst 
in other cases some of the kinetic energy may be transformed into 
potential energy. The law of conservation of momentum is always 


Before collision 

„ L/i 1/2 


After collision 


0 — 


Moves with velocity 
v —^-relative to J?) 


Before collision 




After collision 


the composite ball 
is at rest in Z' 


(a) (b) 

Figure 5.11. The two spheres approach each o ther with equal speeds u’ 
in S' and are brought to rest relative to S’ by the collision as shown in 
[(6). The composite ball moves with uniform velocity v relative to S as 
shown in ( a ) 

valid within the limitations of Newtonian mechanics, and the law 
of conservation of energy also holds provided all forms of energy are 
included. It is now seen if the laws of conservation of energy and 
of momentum are applicable in the general case of the collisions 
of high energy particles when kinetic energy is not necessarily 
conserved. To start with, the example of a completely inelastic 
collision is considered, in which, in the inertial frame S', there is 
no kinetic energy after collision. 

Let two spheres, which have the same rest mass m 0 , move with 
velocity f-u and —a' respectively parallel to the x axis of the 
inertial frame S' and let them collide head-on as shown in Figure 
5.11(b). Let the spheres stick together and form one composite 
ball. Since the total momentum before collision is zero, then, if 
momentum is conserved in the collision, the composite ball must be 
at rest in S' after collision. The sum of the energies of the spheres 
before collision in S' is equal to 

9«J /*2 

E' = 2m'c* = . 0 = 2m 0 c 2 + T' (5.95) 

Vl - u'*lc* ° ' 
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where T' is the sum of the kinetic energies of the colliding particles 
before collision. If it is assumed that all the kinetic energy in E’ 
goes into heat, then the heat produced in the collision, measured in 
E', is equal to 

Q' = T' = E' — 2m 0 e 2 (5.96) 

Now consider the same collision from the inertial frame E, where 
E moves with uniform velocity — v along the x' axis relative to E' 
as shown in Figure 5.11 {a). The total momentum of the two spheres 
in E can be calculated from the total momentum and energy of 
the two spheres in E' just before collision, using the inverse of 
eqn (5.66). One has 

Px = y(p' x + vE'jc 2 ) = 

since the total momentum in E' is zero. Substituting for E' from 
eqn (5.96) one has, if Q' = T', 

P x = Y v [ 2m 0 + = yv(2m 0 + Q'/c 2 ) (5.97) 

Similarly, from eqn (5.67) 

Pv=Px = ° (5-98) 

Similarly, the total energy in E before collision can be calculated 
using the inverse of eqn (5.67). One has 

E = y(E' + vp' x ) = y{E' + 0) = yE' 

Substituting for E' from eqn (5.96), one obtains for the total energy 
in E before collision 

E = y(2m 0 c i + T') (5.99) 

After the collision the composite ball, which is at rest in E', moves 
with velocity v in E. If no matter is destroyed in the collision, then 
one might expect that the rest mass of the composite ball in E might 
be equal to 2 m 0 , in which case the total linear momentum in E 
after collision would be equal to 

2 m 0 v 

Px - V / 1 _ pp = 2ym 0 v (5.100) 

Comparing this with eqn (5.97), it can be seen that if the rest mass 
were 2m 0 , then momentum would not be conserved in the collision. 
In order to conserve momentum in E, the quantity 2 m 0 in eqn 
(5.100) would have to be replaced by M 0 = (2 m 0 + Q'jc 2 ). Thus 


~PufLC. P/ tyl t cl 


SOME EXAMPLES OF EQUIVALENCE OF MASS AND ENERGY 

a linear momentum equal to Q'v/[c 2 ( 1 — » 2 /c 2 ) i ] would have to be 
attributed to the heat generated in the collision, if momentum is 
to be conserved in 27. This heat moves with the composite ball with 
velocity v in 27 and its momentum would be equivalent to that of an 
inertial mass equal to Q’/[c 2 (l — v 2 /c 2 )*] in 27, and an inertial mass 
of Q'/c 2 in 27'. The total energy of the composite ball in 27 after 
collision would then be equal to M 0 c 2 j{\ — v 2 /c 2 ) i = y(2m 0 c 2 + Q’). 
Comparing with eqn (5.99) it will be seen that the total energy 
would also be conserved, since it is being assumed that T' = Q' in 27'. 

If the spheres were perfectly elastic, they would be brought to 
rest momentarily in 27' and the kinetic energy T' would be con¬ 
verted into elastic energy. If momentum is to be conserved during 
the collision in 27, then an inertial mass and a momentum would 
have to be attributed to this elastic energy. In a perfectly elastic 
collision the spheres would then rebound under the influence of the 
elastic forces developed such that they would go back along their 
original paths with velocities —u and +a' relative to 27'. 

It has been shown that if momentum is to be conserved in 
‘point’ collisions, then according to the theory of special relativity 
inertial mass and momentum must be attributed to all the forms of 
energy considered, such that if the energy is A E, then the inertial 
mass associated with it is Am = AE/c 2 . If this is correct for all 
forms of energy, then, when the energy of a system changes, the 
inertial mass of the system should also change. Conversely, there 
should be an energy change associated with a change of inertial 
mass. It is consistent with the theory of special relativity to extend 
this principle to apply to the rest masses of particles and to postulate 
the equivalence of mass and energy in all cases. Whether this 
equivalence holds in practice is, of course, a point which must be 
checked by experiment. Some examples of the equivalence of mass 
and energy will now be considered. 

5.8. SOME EXAMPLES OF THE EQUIVALENCE OF 
MASS AND ENERGY 

5.8.1. The Binding Energy of the Nucleus* 

The protons and neutrons making up the nuclei of atoms are held 
together by attractive forces. Work must generally be done to 
remove protons and neutrons from a stable nucleus, so that the 
potential energy of the particles inside the nucleus must be negative 
and this negative potential energy should have ‘negative inertial 
mass’ associated with it, that is, one would expect the inertial mass 

* The masses quoted are all based on the 19 0 scale. 
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of a nucleus to be less than the sum of the rest masses of the indi¬ 
vidual protons and neutrons making up the nucleus. This is 
confirmed by the values of the atomic weights of isotopes determined 
experimentally using mass spectrographs. The mass defect is 
defined as 

Am = Zm H + (A — Z)m n — M Z . A 

where m H = 1-008142 a.m.u. is the mass of a hydrogen atom, 
m n — 1 -008982 a.m.u,. is the mass of a free neutron and M ZA is 



0 50 100 150 200 250 

Mass number 

Figure 5.12. The mean binding energy per particle for the most 
stable nucleus at each mass number 

the atomic weight of an atom containing Z protons and A — Z 
neutrons. The mass defect is due to the binding energy of the 
nucleus. If the formula A E = Amt; 2 is correct, then the total binding 
energy of the nucleus is given by 

A£(MeV) = 931[1-008142Z + 1-008982 (A - Z) - M ZiA ] 

using the fact that 1 a.m.u. is equivalent to 931 MeV. The average 
binding energy per proton or neutron can be obtained by dividing 
by the mass number A. The results for various nuclei are shown in 
Figure 5.12. The binding energy is negative. 

The first experimental verification of the relation A E = A me 2 
was obtained by Cockroft and Walton 11 who accelerated protons 
up to an energy of 0-25 MeV. Cockroft and Walton observed that 
when these protons were incident upon a lithium target, two 
a-particles were sometimes emitted, the a-particles going in opposite 
directions. From the range of the a-particles, Cockroft and Walton 
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estimated that the kinetic energy of each a-particle was 8-6 MeV. 
The gain in kinetic energy in the reaction was equal to 

(2 X 8-6 - 0-25) MeV = 16-95 MeV 

where the 0-25 MeV represents the kinetic energy of the incident 
proton. Cockroft and Walton suggested that the reaction taking 
place was 

lU + JH — *Be -► |He + |He 

Using the values of atomic masses available at the time Cockroft 
and Walton estimated that 

|Li + JH - 2|He = (0-0154 ± 0-003) a.m.u. 

This mass difference was equivalent to an energy difference of 
(14-3 ± 2-7) MeV if the relation A E — Amc 2 were correct. This 
was in reasonable agreement with the observed increase of kinetic 
energy of 16-95 MeV. The agreement is improved if more recent 
values for the atomic masses are used. Many similar experiments 
have been performed, and in all cases it is possible to estimate the 
energy released or absorbed in a nuclear reaction from the difference 
between the atomic masses of the particles before and after the 
reaction, using the relation A E = A me 2 . 

When protons and neutrons are brought together to form nuclei, 
since the binding energy is negative, energy must be released; for 
example, if one could ‘mix’ two protons with two neutrons to form 
a helium nucleus the loss of mass would be 2JH + 2jn — |He = 
0-030 a.m.u., and an equivalent amount of energy equal to 28 
MeV would be released. This type of nuclear reaction is called a 
fusion or a thermonuclear reaction. It is believed that thermo¬ 
nuclear reactions are important in energy production in stars. It is 
believed that there are two main cycles of nuclear reactions, namely 
the proton—proton chain which predominates in small stars, less 
massive than the sun, and the C, N, O cycle which predominates in 
larger stars (Gamow and Critchfield 12 ). The net result, in each case, 
is to produce a helium nucleus from four protons with a release of 
26-7 MeV of energy per helium nucleus formed. 

From the energy reaching the earth from the sun it is possible to 
estimate the total energy radiated by the sun. The amount radiated 
by the sun is equal to 4 x 10 26 J/sec. Hence 
A E 4 x 10 28 

Am = —g- = ——r-jj ~ 4-4 X 10* kg ~ 4-5 X 10® tons 
c y x iu 

Thus the mass of the sun decreases by over four million tons per 
second. This is small compared with the total mass of the sun which 
is 1-98 X 10 30 kg or 2-1 x 10 27 tons. 
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Another important type of nuclear reaction is fission. It will be 
seen from Figure 5.12 that the average binding energy per nucleon 
(proton or neutron) in the nucleus decreases at very high values of 
atomic weights. The protons and neutrons are not as tightly bound 
in very large nuclei such as uranium as in medium sized nuclei 
of mass number ~100. If a nucleus, such as uranium, were split 
in two, then since the protons and neutrons would be more tightly 
bound in the resulting nuclei, energy should be released. A 
typical example of the fission of a 236 U nucleus is 

2 !!U + £n -> 2 |!U -> ““Xe + ||Sr + 2jn + y + 200 MeV 

5.8.2. Kinetic Energy 

It was shown in Section 5.3 that the inertial mass of a particle 
changed when its kinetic energy was increased. This led to the 
variation of inertial mass with velocity, a result which proved to be 
consistent with experiments (cf. Section 5.4.4). 

5.8.3. Electromagnetic Radiation (Light Quanta ) 

On classical electromagnetic theory, light is considered as a form 
of wave motion and it is assumed that the flux of energy in the waves 
is given by the Poynting vector S, which is related to the electric 
intensity E and the magnetizing force H by the relation 

S = E x H 

It is assumed that the Poynting vector represents the amount of 
energy crossing unit area perpendicular to the vector S per second. 
According to the ideas developed in Section 5.7 a momentum flow 
should be associated with this energy flow. If this were correct, 
then, when light is reflected at the surface of a body, since the 
direction of energy and momentum flow is changed, there should 
be a transfer of momentum from the light to the reflector. The 
existence of this radiation pressure was first shown by Lebedew in 
1900, before the development of the theory of special relativity. 

In 1900 Planck introduced the idea that radiation is emitted in 
the form of discrete bundles or quanta of energy. According to 
Planck, if the frequency of the radiation is v, then the energy of each 
quantum is equal to hv, where h is called Planck’s constant and is 
equal to 6-625 X 10 -34 J.sec. Such a quantum is generally called 
a photon, though if its energy is very high it is sometimes referred 
to as a y-ray. Einstein used the photon concept to interpret the 
photoelectric effect and suggested that if light of frequency v is 
incident on the surface of a substance, individual photoelectrons 
are emitted from the surface when single photons are absorbed. 
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According to Einstein, the kinetic energy of the electrons is equal to 
hv — cf>, where <f> is the work function. 

The existence of individual photons travelling with the velocity 
of light is now an accepted experimental fact. For a particle of 
rest mass m 0 one has 

m 0 _ w 0 u g m 0 c 2 

Vl — u 2 jc 2 ’ ** V1 — h 2 /c 2 ’ Vl — u 2 jc 2 

As u tends to c, the denominator in each case tends to zero, but if 
the rest mass also tends to zero, then each of the above quantities 
can remain finite. If m 0 —» 0 when u —*■ c such that m 0 /(l — « 2 /c 2 )* 
equals k, then 

m = k; p = kc; E = kc 2 

Now for a photon, E equals hv so that k must be equal to hv/c 2 . 
Hence, according to the theory of special relativity a photon of 
energy hv has a linear momentum hv/c and an inertial mass hv/c 2 
associated with it. It is sometimes said that a photon has zero rest 
mass. Since photons travel with the velocity of light, it is impossible 
to find an inertial reference frame in which the photons are at rest, 
so that the term rest mass is not strictly applicable to photons. 

The above theory was applied successfully by Compton to inter¬ 
pret what is now known as the Compton effect. Compton showed 
experimentally that when a monochromatic beam of x-rays is 
scattered by a light element such as carbon, the scattered radiation 
consists of two components, one having the same wavelength as the 
incident radiation and the other having a slightly longer wavelength. 
In order to account for the presence of the longer wavelength 
Compton suggested that the scattering process could be treated as 
an elastic collision between a single photon and a free electron. Let 
a photon of energy hv and momentum hv/c, moving along the 
positive x direction, be incident upon a stationary electron of rest 
mass m 0 in an inertial frame E as shown in Figure 5.13. Choose the 
direction of the y axis such that the scattered photon and recoiling 
electron are in the xy plane after the collision. Let the photon have 
energy hv', and move off at an angle 6 to the x axis after collision 
and let the electron move off at an angle <f> with velocity u as shown 
in Figure 5.13. According to the law of conservation of energy: 

hv + m Q c 2 = hv' + me 2 (5.101) 

where m = m 0 /( 1 — m 2 /c 2 )* is the relativistic mass of the electron 
after collision. Since c — h> = AV, eqn (5.101) can be rewritten as 


me = j — j} + rritf 
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/♦ 



Figure 5.13. The Compton effect. A photon of energy hv and 
momentum hvjc is incident on a stationary electron of rest mass 
m 0 . After collision a photon has energy hv' {which is less than 
hv) and momentum hv'jc. The electron recoils with velocity u 


Squaring, 


that is 


A 2 h 2 2h 2 „ (h h\ , 
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A 2 A 2 2A 2 
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From the law of conservation of momentum, 
hv hv' 

— _ — cos 0 4 -mu cos i 
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hv' 


Rearranging, 


0 = — sin 6 — mu sin <A 
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mu cos 9 = 2 — ~X cos " 
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Squaring and adding, 
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i 

that is, 
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Hence m 2 u 2 = (m 2 — m 2 )c 2 , so that the right-hand sides of eqns 
(5.102) and (5.103) can be equated, giving 


Hence, 


2 h 2 n 

IT + 2m ° c 



2 h 2 

XI 


cos 9 


AA 



— cos 6 ) 


(5.104) 


Substituting numerical values for h, m 0 and c one obtains 

AA = 0-0242(1 - cos 9) A (5.105) 

Eqn (5.105) states that when a photon is scattered through an angle 
9 by a free electron, the wavelength A' of the scattered photon should 
be greater than the wavelength of the incident photon by an amount 
0-0242(1 — cos 9) A. The predicted dependence of AA on 9 was 
verified by experiment. The kinetic energies of the recoil electrons 
have also been measured and found to be in agreement with the 
calculated values. Thus, by attributing momentum hvfc to a 
photon of energy hv, Compton was able to account for the Compton 
effect. The calculation of the cross-section for the scattering process 
had to await the development of relativistic quantum mechanics. 

It was shown in Section 4.4 that, using the wave theory of light, 
one could account for a large range of optical phenomena including 
the aberration of the light from stars and the Doppler effect. These 
phenomena will now be discussed using the photon model of light. 
Let a photon of energy hv be emitted from a star at rest in an inertial 
frame 27. Let the direction of the photon be towards the origin as 
shown in Figure 5.14(a). The momentum of the photon measured 
in 27 has components 


p x = —( hv/c ) cos «; p y = —( hvjc) sin a; p t = 0 


It will be assumed that the momentum and energy of a photon 
transform like a single particle. In an inertial frame 27' moving with 
velocity v relative to 27 as shown in Figure 5.14(b), using the equation 



one obtains 


(5.54) 


, / hv hv\ hv 

Px = rl — — cos a — v I = — y — (cos a + vjc) 
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From eqn (5.55), 


Pv 


tan oq 


hv . 

— Pv~ —— sm a 

p' v sin aV 1 — v 2 [c 2 

~ p' x ~~ cos a + vjc 


(5.106) 


This formula is in agreement with eqns (4.24) and (4.33), which 
were derived in Section 4.4, using the wave model of light; eqn 
(5.106) represents the formula for aberration under the same 
conditions as considered in Section 4.4.2. 



rCThe geocentric system moves 
with velocity y-*relative to 2) 



v T he star 
moves in Z* 


The direction in which 
the telescope is 
pointed in Z f 


The earth is at rest in Z 1 


(a) ( b ) 

Figure 5.14. A photon is emitted from a star, (a) The photon has energy hv and momentum 
hvjc in the inertial frame H, in which the star is at rest. ( b ) The energy and momentum in 
the geocentric system can be calculated using the momentum and energy transformations of 
the theory of special relativity. This gives the correct formula for the aberration of the light 

from the star 


In order to treat the Doppler effect, the same conditions as used 
in Section 4.4.4, and illustrated in Figure 4.4 are considered. Let a 
source at rest at the origin of E' emit a photon at a time t' = t — 0 
when the origins of E and E' coincide. Let the frequency of the 
photon measured in E' be v', and let the frequency of the same 
photon, measured in E, be v. Let the photon make an angle d 
to the x axis in E so that its momentum has components p x = 
{hvjc) cos 6, p v = (hvjc ) sin 6; p z = 0, in E. From eqn (5.57), 

E' = y{E - vp x ) 

that is, / hv \ 

hv' = yl hv — v — cos dj 
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which reduces to _ 

t’Vl - v 2 /c 2 

(1 — ( v/c ) cos 6) 


(5.107) 


This is in agreement with eqn (4.36). Thus the Doppler effect and 
the aberration of the light from stars can be accounted for satis¬ 
factorily on the photon model of light in terms of the different 
measures of the energies and momenta of photons in Z and Z'. 

In addition to the photon, it is now believed that the neutrino 
and antineutrino also have ‘zero rest mass’ but unlike the photon 
they have spin |(A/ 2tt). If neutrinos have ‘zero rest mass’ then they 
must travel with the velocity of light. 


5.8.4. Pair Production and Positron Annihilation 

It would be consistent with the theory of special relativity to 
extend the equivalence of mass and energy to the rest masses of 
particles and write 

E = m 0 c 2 

According to this relation an amount of energy equal to m 0 c 2 
would have to be expended in creating a particle of rest mass m 0 , 
and if a particle of finite rest mass were annihilated then an amount 
of energy equal to m 0 c 2 would be released. After the discovery of the 
positron, pair production was discovered. If a photon has energy 
hv > 2 m 0 c 2 , where m 0 is the rest mass of the electron, then the photon 
can produce an electron pair. The process does not take place 
spontaneously in a vacuum but it appears that the presence of an 
electric charge, such as the charge on an atomic nucleus or an 
electron, is necessary before pair production can take place. With¬ 
out the presence of the charged particle both energy and momentum 
could not be conserved in pair production. Experiments show that 
the law of conservation of linear momentum and energy do hold 
in pair production. Some of the radiant energy is transformed into 
the rest masses of the positron and the electron. If the energy of the 
photon exceeds 2m 0 c 2 , then the excess energy is distributed between 
the electron and positron produced and the recoiling electric charge. 

When a positron slows down it is attracted to a negative electron 
and forms positronium. The electron and positron move in orbits 
around their centre of mass in a way similar to the hydrogen atom, 
except that the masses of the two particles are equal in this case. 
In the lowest Bohr orbit the quantum numbers n and l have the 
values 1 and 0 respectively. If the spin of the electron and the 
positron are opposite in direction the singlet X S state is formed; if 
the spins are parallel then the triplet *S state is formed. After an 
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average lifetime of ~10 -10 sec in the kS state the positron and 
electron annihilate each other according to the reaction 

e+ + e_ = 2 hv 

Since two photons are emitted, it is possible to conserve both energy 
and linear momentum. The energy of each photon should be equal 
to m 0 c 2 = 0-511 MeV; this is confirmed by experiment. In the 
triplet 3 S state, three photons are emitted, the lifetime being 
—1-5 X 10- 7 sec. Since three photons are emitted, spin and angular 
momentum can be conserved since the spin of the photon is one 
unit. If linear momentum is to be conserved, the three photons 
must be coplanar; this is confirmed by experiment. In this case 
the energy is distributed statistically between the three photons. 
Positron an ni hila tion is an example of the conversion of rest-mass 
energy into radiant energy. 

5.8.5. Particle Decays 

When unstable fundamental particles decay at rest or in flight, 
the dynamics of the decay schemes can be interpreted in terms of the 
laws of conservation of linear momentum and conservation of mass- 
energy. As an example, the decay of a charged 7r-meson when it 
is at rest is considered. 

When a charged w-meson decays, it generally decays into a 
jM-meson and one neutral particle which is now believed to be a 
neutrino. In the inertial frame S' in which the 7r-meson is at rest 
when it decays (the c.m.-system), if linear momentum is conserved, 
then the /i-meson and the neutrino must be emitted in opposite 
directions as shown in Figure 5.15{b), In S' 

&=p:=p' ( 5 - 108 ) 

Some of the rest mass energy of the 7r-meson is transformed into 
the rest masses of the ^it-meson and neutrino (if it has any rest mass), 
the remainder of the energy available appears as the kinetic energies 
of the ft- meson and the neutrino. In S', according to the law of 
conservation of energy, 

= £; + £; (5.109) 

or 

mj 2 = V{(w/ 2 ) 2 + p'*t?} + V{(m/ 2 ) 2 + P?c*} 

where m„ m tl and m y are the rest masses of the ir-meson, the /t-meson 
and the neutrino respectively. 

From eqn (5.108), 

c%* = 
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®~ — r ' - v 
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The IT- meson decays 
In flight in 2 
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Neutrino 
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The TT- meson decays when 
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Figure 5.15. The decay of rr-mesons. In S' the rr-meson is at rest when it decays. The 
fi-meson and the neutrino are emitted in opposite directions in S' as shown in (A). The angle 
a.' is a parameter which varies from decay to decay. In S the rr-meson is moving with velocity v 
when it decays as shown in (a) and the fi-meson and neutrino are not emitted in opposite directions 


so that 

E’/ ~ m V = E * 2 ~ m Y 
or 

(2?; 2 - E?) = (£■; + e:)(e; - K) = - miy 

From eqn (5.109), 

(£; + E£) = my 

Dividing eqn (5.110) by eqn (5.111) one obtains 

E ._ E , 

f* v m 


(5.110) 

(5.111) 

(5.112) 


Adding eqns (5.112) and (5.111), one obtains 

{ml + ml — miy 




2 m_ 


Subtracting eqn (5.112) from eqn (5.111), 

K + < - ml) 


EL = 


2 m_ 


(5.113) 


(5.114) 


Eqns (5.113) and (5.114) give the total energies of the ^-meson and 
the neutrino respectively in the inertial frame 27' in which the 
7T-meson is at rest when it decays. The momenta of the /r-meson 
and the neutrino in 27' are given by the relation 


cp’ = V(E'/ - m 2 y) = V(E' r 2 - my) (5.115) 
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If the 7T-meson decays in flight as shown in Figure 5.15(a), the 
energies and momenta of the ^-meson and neutrino in the lab.- 
system E can be calculated using the transformations for the energy 
and momentum of a single particle derived in Section 5.5 [eqns 
(5.59), (5.60), (5.61) and (5.62)]. The angle a' which the ft -meson 
makes with the x' axis in E' is a parameter which may be different 
in different decays. Alternatively, the laws of conservation of energy 
and of momentum could be applied directly in the lab.-system E, 
in which the ir-meson decays in flight. As the energy of the parent 
77 -meson is increased relative to the laboratory, the /^-mesons are 
emitted more and more in the direction of motion of the parent 
7r-meson in the lab.-system. 

The above analysis holds whenever an unstable particle decays 
into two particles. If M is the rest mass of the decaying particle 
and m 1 and m 2 are the rest masses of the secondary particles, then 
eqns (5.113), (5.114) and (5.115) become 

E[ = (M 2 + m\- m 2 )c 2 l2M (5.116) 

E; = (M 2 + m\ - m\)c 2 l2M (5.117) 

cp' = V(E ' 2 - mY) = V(E ' 2 - m|c 4 ) (5.118) 

Some fundamental particles decay into three particles, for 
example a /x-meson decays into an electron and two neutrinos. 
Sometimes A'-mesons decay into three particles, for example the 
decay scheme of what was originally called a r-meson is 

K+ (or T + ) —► 7T + + 7T+ + 7T~ 

When the t + meson decays in a nuclear emulsion, the tracks of the 
three 7r-mesons are visible, and if they end in the nuclear emulsion 
their energies can be estimated from their ranges in the emulsion. 
It is found that the three 7r-mesons are coplanar, as they should be if 
linear momentum is conserved. It is also found that, though the 
momentum and energy available is distributed statistically between 
the three 7r-mesons, both energy and linear momentum are con¬ 
served in the decays. It is possible to interpret the dynamics of all 
particle decays in terms of the laws of conservation of energy and of 
linear momentum. Other laws, such as the laws of conservation of 
charge, conservation of spin and angular momentum, conservation 
of baryons and conservation of leptons appear to be valid also. In 
three-particle decays energy and momentum are distributed statisti¬ 
cally between the decay particles. Whilst these energies and 
momenta are consistent with the laws of conservation of linear mo¬ 
mentum and conservation of energy, these laws alone say nothing 
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about the probabilities of various energy values. For example, in 
order to estimate the energy distribution of electrons from //-meson 
decay, one must use a precise meson theory. 


Example: A 77 -meson of rest mass 139-6 MeV/c 2 decays at rest into a 
//-meson of rest mass 105-7 MeV// 2 and a neutrino of zero rest mass. 
Calculate the kinetic energy of the //-meson. If the 77 -meson was moving 
with a velocity of 0-8/ relative to the laboratory when it decayed, calculate 
the maximum and minimum possible values for the kinetic energy of the 
//-meson relative to the laboratory. 

Substituting for m r c 2 , my and m v c 2 into eqn (5.113), 




139-62 + 105-7 2 - 0 2 
2 X 139-6 


109-8 MeV 


r; = 109-8 - my = 4-1 MeV 
Similarly for the neutrino. 


E' v = T' v = 29-8 MeV 

Now, 

cp = VE?—my = V 109-8 2 - 105-7 2 == 29-8 MeV 

If the 7r-meson is moving when it decays, the maximum and 
minimum values for the kinetic energy of the //-meson are when 
the //-meson is emitted in the forward and the backward directions 
respectively in the c.m.-system, that is parallel and antiparallel to 
the x axis in Figure 5.15(b). 

Since 

ujc = 0-8, y = 3 

(£„)max = y(K + vp' x ) = f(109-8 + 0-8 X 29-8) 

(£„)max = 223 MeV; (7;) max = 117 MeV 
(-E„) mi „ = #(109-8 - 0-8 X 29-8) = 143 MeV 
(7;)min = 37 MeV 

In the general case when a' is not 0 or 180 degrees in Figure 5.15(b), 
the energy, momentum and direction of the //-meson in S can also 
be calculated using the energy and momentum transformations. 


5.8.6. Experimental Check of the Principle of the Constancy of the Speed 
of Light in Empty Space 

In high energy nuclear interactions, 7r°-mesons are sometimes 
produced. These 7r°-mesons have a rest mass of 135 MeV/r 2 . They 
are unstable and generally decay into two y-rays (photons). In the 
rest frame of the decaying 77°-mesons, the y -rays have energies of 
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67-5 MeV each. The mean lifetime of 7r°-mesons, when they are 
at rest, is ~0-7 X 10~ 16 sec. In the rest frame of the decaying 
7r°-mesons, the speed of the photons arising from 7r°-meson decay 
should be equal to the accepted value for the speed of light emitted 
by a stationary source, namely c = 2-9979 X 10 8 m/sec. If the 
7 r°-mesons are moving relative to the laboratory when they decay, 
according to the principle of the constancy of the speed of light in 
empty space, the speed of the y-rays should still be equal to c 
relative to the laboratory whatever the speed of the source of the 
y-rays, (7r°-mesons in this case). Alvager, Farley, Kjellman and 
Wallin 13 (1964) measured the velocity of y-rays from 7r°-mesons 
of energy >6 GeV, corresponding to (1 — m 2 /c 2 )~* > 45. The 
velocity of the photons was measured by timing the y-rays over a 
known distance. 

The experiment was performed with the CERN Proton Syn¬ 
chrotron, using protons of momenta 19-2 GeV/c. The circulating 
proton beam consisted of bunches of protons a few nano-sec 
(10~ 9 sec) long. The 7r°-mesons were produced in a beryllium 
target in the nuclear disintegrations produced by the 19-2 GeV/c 
protons. The y-rays arising from 7r°-meson decays were observed 
at an angle of ~6 degrees to the proton direction. Bending 
magnets were used to sweep away charged particles. The ir°-mesons 
decayed quickly (<1 cm) into y-rays, so that each bunch of protons 
gave a bunch of y-rays arising from 7r°-meson decay. The y-rays 
were detected by means of a 4 mm lead absorber (which converted 
some of the y-rays in each bunch into electron pairs), followed by a 
small plastic scintillator in coincidence with a lead-glass Cerenkov 
counter to detect the electron pairs from y-rays of energy >6 GeV. 
Two such detectors were used at a distance of (31-450 ± 0-0015) m 
apart. Some of the y-rays in each bunch were converted into 
electron pairs in the first detector and some of the y-rays in the 
same bunch were converted into electron pairs in the second 
detector. The time interval between the two pulses was measured 
electrically. The time of flight (■~10~ 7 sec) between the two 
detectors was substantially longer than the length of each pulse of 
y-rays. From the time of flight, the velocity of the y-rays was 
determined. The result for the velocity of y-rays of energy >6 GeV 
from a source moving with a velocity of 0-99975 c (calculated 
using the equations of the theory of special relativity) was 
(2-9977 ± 0-0004) X 10 8 m/sec. This was the same, within experi¬ 
mental error, as the accepted value of 2-9979 X 10 8 m/sec for the 
velocity of light from a stationary source. A similar experiment 
using quanta arising from positron annihilation was carried out by 
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Sadeh 14 (1963). This experiment also confirmed the principle of 
the constancy of the speed of light. 

It was shown in Section 5.4.2 that accelerated electrons tend to 
a limiting speed, namely the speed of light. If all inertial frames 
are equivalent, and if the units of length and time are the same in 
all of them, then the limiting speed should be the same in all 
inertial frames. Otherwise, it would be possible to differentiate 
between the various inertial frames in terms of the different values 
for the limiting speed. This result was called the principle of the 
constancy of the limiting speed for particles. Photons, or light 
quanta such as y-rays, can be considered as particles of ‘zero rest 
mass’ travelling at the limiting speed (cf. Section 5.8.3). According 
to the principle of the constancy of the limiting speed of particles, 
this limiting speed is the same and equal to c in all inertial frames 
(cf. Section 5.4.2). This, in the case of photons, becomes the 
principle of the constancy of the speed of light. 

5.8.7. Meson Production 

When fast protons or neutrons collide with atomic nuclei, 77- 
mesons can be produced, provided the energy of the incident 
particle is high enough. The charged 7r-mesons produced have 
masses of 273-2 electron masses and the neutral 77 -mesons have 
masses of 264-4 electron masses. In these collisions some of the 
kinetic energy of the incident particle is converted into the rest mass 
energies of the 7r-mesons. At higher energies AT-mesons, nucleons, 
negative protons, antineutrons and hyperons may be produced. 
Mesons can also be produced by high energy photons. Experi¬ 
ments show that the laws of conservation of linear momentum and 
energy are valid in these collisions. A reader interested in a full 
discussion of the dynamics and interpretation of such collisions is 
referred to Blaton 15 , Dedrick 16 and Powell, Fowler and Perkins 17 . 
We shall confine our remarks to a brief discussion of some of the 
more important points. 

It is not possible to transform all the kinetic energy of the colliding 
particles into some other form of energy (except when the lab.- 
system is the c.m.-system, for example, when two particles of equal 
rest masses approach each other with equal speeds). If the total 
momentum is not zero in the laboratory system before the collision, 
then, if momentum is conserved in the collision, there must still be 
the same total momentum present relative to the laboratory after 
the collision, so that some, at least, of the particles present after the 
collision must be moving and have kinetic energy relative to the 
laboratory. However, in the c.m.-system the total momentum 
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before the collision is zero, so that it is possible, in principle, for all 
the particles to be at rest relative to the c.m.-system after collision 
and still conserve momentum. In this case, all the kinetic energy 
in the c.m.-system would have been converted into some other form 
of energy by the collision, for example into the rest mass energy of 
a created particle, or into heat. 

The collision of two particles will be considered. Let an incident 
particle of rest mass m x , momentum p x and total energy E x collide 
with a stationary particle of rest mass m 2 . It was shown in Section 
5.5 that the sum of the momenta of the two particles before collision 
transforms in the same way as the momentum of a single particle. 
Using eqn (5.66), if is parallel to the x axis, 

P'x = yiP - vE l<?) 

where, before collision p = p x and E = (mfc 4 + c 2 /> 2 )* + m 2 c 2 . In 
the c.m.-system p' = 0, hence 


n .=P‘ 2 IE- 


PlC* 


PiC* 


(E x + m 2 c 2 ) V (pfc 2 + m\c A ) -f m.f 1 


(5.119) 


If the momentum p and energy E are conserved in the collision in 
all inertial reference frames, the c.m.-system must continue to move 
with velocity u c . m . relative to the lab.-system after the collision. 

It is convenient to discuss the inelastic collisions of atomic particles 
in terms of the energy available in the c.m.-system. From eqn 
(5.68), we have for the sum of the momenta and the sum of the total 
energies of the colliding particles in E and E' before collision 

pi - £ 2 /c 2 = p _ (5 . 68) 

c 


Now if the struck particle is at rest in the lab.-system before collision 
and the moving particle has total energy E x , one has E — E x + m 2 c 2 . 
If E' is the c.m.-system, then p' = 0. Hence eqn (5.68) becomes 

E ' 2 = {E x + m 2 c 2 ) 2 - p 2 c 2 (5.120) 

For particle 1 before collision 

p\c 2 = E\ — my 

The momentum of particle 2 is zero before the collision. Substi¬ 
tuting in eqn (5.120), and taking the square root, 

E' = (2m 2 c 2 E 1 + my + m 2 y)i (5.121) 

This is the total energy available in the c.m.-system. It includes 
the rest mass energies of the colliding particles. In the extreme 
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relativistic case u x ^ c, rn^c 2 < E 1 and m 2 c 2 < E v then the total 
energy E' available in the c.m.-system is ^(2m 2 c 2 E 1 )i. Thus at 
extremely high energies, the energy available in the c.m.-system 
increases approximately as the square root of E v the energy of the 
incident particle. For a proton of energy 30 GeV incident on a 
stationary nucleon (i.e. proton or neutron) there is about 7 GeV 
total energy available in the c.m.-system. For a 300 GeV proton 
the energy available in the c.m.-system is —22 GeV. 

Some of the total energy available in the c.m.-system goes into 
the rest masses of the particles present after the collision. These 
particles may be the same as or different from the colliding particles. 


Before collision After collision 



Figure 5.16. Two particles of rest masses m, and m 2 collide. 
The increase in rest mass after the collision is equal to Am 


Sometimes new particles are produced. For example, let the 
reaction be as shown in Figure 5.16. Let A m be the sum of the rest 
masses of the new particles. To find the minimum energy necessary 
to produce the new particles, it will be assumed that all the particles 
are at rest in the c.m.-system after the collision. From eqn (5.121) 
one has for E' the total energy available in the c.m.-system 

E' 2 = 2 m 2 c 2 E 1 + m\fi -f m|<r 4 (5.122) 

After collision, if all the kinetic energy is zero in the c.m.-system, 
then 

E' 2 = {mf 2 + tn 2 c 2 (A m)c 2 } 2 (5.123) 

Equating the right-hand sides of eqns (5.122) and (5.123), one 
finds for 7\ = E 1 — m x c 2 

T '-<M' + 2 + £) (5.124) 

This is the kinetic energy necessary in the lab.-system to produce 
extra particles of total rest mass A m, if the rest masses of the colliding 
particles are unchanged. As an example consider the reaction 

V + P P + *r° 
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in which an incident photon produces a 7r°-meson in a collision with 
a stationary proton. Using p = 1836m,,, v° = 264-4 m e , one finds 
T x is equal to 145 MeV. Thus the energy of the incident photon 
must exceed 145 MeV if a -iT°-meson is to be produced in a collision 
with a proton. 

As another example consider the reaction 

p + p-*p + n + 7r+ 

Neglecting the difference between the neutron and proton rest 
masses, and taking the rest mass of the 7r-meson as 273-2 m e , we 
find 7\ = 290 MeV. Thus the incident proton must have a kinetic 
energy exceeding 290 MeV to produce a 7r+-meson in a proton- 
proton collision. It is interesting to note that the photon threshold 
energy is substantially less than the proton threshold energy for 
ir-meson production. The production of a proton-antiproton pair 
in a proton-proton collision according to the reaction 

p+p=P+P+P+P 

requires a proton kinetic energy of at least 5-6 GeV in the lab.- 
system. If the incident kinetic energy exceeds the threshold kinetic 
energy, the excess energy is distributed among the particles present 
after the collision. 

5.9. TURNING LIGHT INTO GAMMA RAYS 

It is sometimes convenient to consider phenomena from a different 
reference frame from the laboratory. In this new reference frame, 
the experimental conditions are different and may correspond to 
well known phenomena which have been studied extensively in the 
laboratory system. As an example of this approach, we shall 
consider an experiment carried out by Kulikov and colleagues 18 . 
In this experiment, red light of wavelength 6943 A from a ruby 
laser collided head on with 500 MeV electrons in an electron 
synchrotron as shown in Figure 5.17(a). Using E = hv, where h 
is Planck’s constant and v is the frequency of the light, it follows 
that the energies of the incident photons are about 2 eV in the 
lab.-system. 

Now consider the collision from the reference frame E' shown in 
Figure 5.17(b), in which the electrons are at rest before the collision. 
This reference frame moves with velocity v relative to E, where v 
is the speed of the electrons relative to the laboratory. From eqn 
(5.57) 

E' — y(E — vp x ) (5.125) 
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where E is the energy and p is the momentum of the photon 
relative to the laboratory and y = (1 - V 2 jc 2 )-K For the incident 
photon, let E == kv = 2 eV; p x ------ —hvlc; p x is negative since the 

photon moves in the negative * direction in Figure 5.17(a) Sub¬ 
stituting in eqn (5.125) 

E' = y{hv + ^ = 2y^l + eV (5.126) 

Now the energy of the electron relative to the laboratory is 
ym 0 c ~ 500 MeV. Since for an electron, m 0 c 2 ~ 0-5 Me V, y is 


2 (Laboratory system) 

BEFORE COLLISION 
| 50 0 Mev Red light from 

electron hv~2eV laser 

AFTER COLLISION 
e^fy-AMeV 




Electron 


(o) 


H’fMoves-—v relative to £ 
y' i Electron at rest before 
1 collision) 

BEFORE COLLISION 
Electron ^^^E'=hv'~hke\/ 
at rest* p\--hv’!c 

after collision 


0 ' 



E'-hu'-hkeV 


Electron 


(b) 


Figure 5.17. (a) Red light from a ruby laser collides with 500 MeV electrons. The photons 
present after collision have energy m the MeV range, (b) The electrons are at rest in Z’ 
before collision. The photons from the laser have energies in the keV range in Z' The 
normal formulae for the Compton effect are valid in Z' 


'"■'1,000 and v^c. Substituting in eqn (5.126), we find E' ~ 4,000 
eV or ~4 keV. Thus in the reference frame in which the electrons 
are at rest before the collision, the incident photons from the laser 
have energies ~4 keV, which is a typical value for X-rays. Since 
the electron is at rest in 17, the normal formulae for the Compton 
effect should hold. Applying conservation of energy and linear 
momentum, it was shown in Section 5.8.3 that 

= — (i — cos a) = 0-0242(1 — cos a) A 

where A/T is the change in wavelength, and a is the angle of 
scattering in 17. For example, if a = tt/ 2, in Z\ AE is ~0-0242 A. 
This is small compared with the wavelength of a 4 keV photonj 
which is ~3-l A. Hence the changes in wavelength and frequency 
are small, so that the change in the energy of the photon is small. 
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The reader can verify this by putting a = tt/ 2 in eqn (5.90) and 
showing that the energy of the photon present after a collision 
with a stationary electron of rest mass m 0 is E'ftl + E jm^) == 
£'/(l + 4/500). Thus in E', we have photons of energy ~4 keV 
scattered in various directions after collision with stationary 
electrons, in accordance with the theory of the Compton effect. 
As a typical example take a = 90 degrees. For the photon present 
after collision, in E' we then have, E' = 4 keV, p x = 0,p v = E jc. 
Applying the inverse transformation to transform back to the lab.- 
system, we have 

E = y(E' + vp' x ) = 1,000£' 4 MeV 

Applying the momentum transformations 

Pv — Py~ E'/c 

Px = y(Px + vE'/c 2 ) = yvE'jc 2 ^ yE'jc 

Hence in Figure 5.17(a), after collision 

tan 6 — 6 = pjp x = 1/y = 10“ s radians 

Similar calculations can be applied for other values of scattering 
angle « in the rest frame of the incident electrons. Thus relative 
to the laboratory, the scattered photons have energies in the MeV 
(or y- ray) region. They are scattered into a narrow beam, opposite 
in direction to the direction of the incident light but in the direction 
of the incident electrons. Thus the energetics of the process dis¬ 
covered by Kulikov and colleagues 18 can be illustrated in terms 
of the Compton effect using the energy and momentum trans¬ 
formations of the theory of relativity. 

It is interesting to note that this process may occasionally give 
rise to X-rays and y-rays in stars. Fast electrons may sometimes be 
produced, e.g. in stars such as the Crab Nebula, and possibly in 
sunspots. If sufficiently fast electrons collide with visible radiation, 
X-rays can be produced by the above process. 

5.10. ROCKET MOTION 

A rocket is initially at rest relative to the inertial reference frame E. 
It is assumed that there are no external forces acting on the rocket. 
The rocket propels itself rectilinearly by emitting gases in the 
backward direction. Let the gases be emitted at a constant speed w 
relative to the rocket. Consider the instant when the rocket is 
moving with a velocity v parallel to the x axis of E as shown in 
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Before emitting gas hm Q 


m n 




After emitting gas hm g 
om n m 


m Q + 5 m Q 


(a) 


/ 


0' 


I' (Moves ——v relative to I) 
Before emitting gas bm g 

^ 1 ^ > Rocket is 

/77_ instantaneously 
0 at rest 


After emitting gas 5 m g 
m Q + 6 m 0 


W lP 

orr>g 


tb) 


-x' 


Figure 5.18. (a) The rocket is moving with velocity v relative to X. (b) The rocket is 

, r7l!i‘ti, t , an / 0 r^ if rest , rel f we t0 Ej After emitting gases of rest mass 8m, at a speed w 
relative to X , the rocket has a speed Sv relative to X', its rest mass changes to m, + 8m 0 
1 he speed of the rocket , relative to 27, changes to v -}- Sv 


Figure 5.18(a). At this instant, the rocket is instantaneously at rest 
in the inertial frame S', shown in Figure 5.18(b). Let m 0 be the rest 
mass of the rocket at this instant. Let the increase in the speed of 
the rocket relative to S' be Sv' when gases of rest mass Sm„ are 
emitted with a speed w relative to S'. Let the rest mass of the rocket 
change from m 0 to m 0 + 8m 0 . Application of the law of conservation 
of mass-energy relative to S' gives 

~ .2 . Oo + Sm 0 )c 2 dm„c 2 

■— ‘ r H- - 

Vl — (8v') 2 Jc 2 V1 — w 2 jc 2 
Neglecting terms of order (Sv') 2 gives 


~dm 0 c 2 = 


dm g c 2 

Vl~w 2 lc 2 


(5.127) 


The decrease in the rest mass-energy of the rocket dm 0 c 2 is not equal 
to Sm g c 2 , the rest mass-energy of the emitted gases, since the gases 
have a velocity w and kinetic energy relative to S', 

Application of the law of conservation of linear momentum 
relative to S' gives 

n = _ bm « w ( m o + <K) to' 

Vl - w 2 /c 2 V1 - (dv'Yjc 2 


Neglecting terms of order 8m 0 8v' and (Sv') 2 , we have 


Sm g w 

Vl — w 2 jc 2 
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Substituting for 8m g in eqn (5.127) gives 

— 8m 0 w = m 0 8v' (5.129) 


Using the velocity transformations, eqn (4.9), since relative to 27' 
the speed of the rocket is u' x = 8v, we obtain for the velocity of the 
rocket relative to 27, 


(v + 8v) 


(8v r + v) 

(1 + vdv'/c 2 ) 

8v =£= dz/(l 


^ (v + 8v 
— c 2 /c 2 ) 




Substituting for 8v’ into eqn (5.129) gives 


Rearranging, 


m 0 8v 

—8m a w = ) 

w 8m 0 8v 

m^~ ” (1 - v 2 /c 2 ) 


(5.130) 


Alternatively, the reader can apply conservation of mass-energy 
and conservation of momentum in the laboratory system. (Ref¬ 
erence: Halfman 19 .) Integrating, 


—zi>[ln 


L2 ln 



U 


0 


where and m f are the initial rest mass and the final rest mass of 
the rocket respectively, and u is the final velocity of the rocket 
relative to 27. Taking exponentials 

trh _ l c + u 
m f \c —■ u 

With present day rockets, w the exhaust speed of the gases is of the 
order of 10 km/sec so that c/2w = 3 X 10 8 /2 X 10 4 = T5 X 10 4 . 
If the final speed of the rocket is to be say c/2, then 



If the final mass of the rocket is to be large enough to hold an 
astronaut, say 1 ton, one would have to start with a rocket weighing 
about 10 7,157 tons. It can be seen that, with present day rocketry 
techniques, it would be impossible to accelerate a rocket to a speed 
comparable with the speed of light. One can accelerate atomic 
particles to such speeds due to their small rest mass. In these cases 
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the energy is applied by external methods using electric and mag¬ 
netic fields. The discussion of what may or may not happen to 
astronauts travelling with speeds comparable to the speed of light 
is largely of academic interest with present day gas rockets. (If 
however, one could use photons (w = c ) or ions with (w c ) then 
for a = c/2, mjm f would only be V3.) 

The acceleration of the rocket and the distance it will travel in a 
given time relative to E depend on the rate at which the exhaust 
gases are emitted. One would have the most comfortable journey, 
if the rate of emission of gases were controlled such that the accel¬ 
eration of the rocket relative to the inertial frame, in which it is 
instantaneously at rest, were always equal to g, the terrestrial value 
of the acceleration due to gravity (g .—' 9-81 m/sec a ). It will be 
shown in Chapter 9 that the conditions inside the rocket would 
then be similar to conditions on the earth. Proceeding as in 
Problem 4.10, we would obtain ° 



for the velocity and the distance travelled relative to E after a time 
t. This is an example of hyperbolic motion. In one year the rocket 
would travel about 0-43 light years and have a final speed of about 
0-75c relative to the earth. The time for the journey, relative to 
the rocket, can be calculated using the clock hypothesis introduced 
in Chapter 8 (cf. eqn (8.8) and Problems (8.8) and (8.9)). 

5.11. A REVIEW OF RELATIVISTIC MECHANICS 

The theory developed in this Chapter was not deduced from the 
principle of relativity and the principle of the constancy of the 
velocity of light. The approach adopted was to try to modify 
the laws of Newtonian mechanics, such that they obeyed the 
principle of relativity when the co-ordinates, times and velocities 
were transformed according to the transformations of the theory of 
special relativity. This necessitated a redefinition of mass, such that 
the mass of a body depended on its velocity, and this in turn 
necessitated a redefinition of momentum, force, energy, etc. The 
resulting theory proved to be in excellent agreement with the 
experimental results. The dynamics of a single particle was based 
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on the equation 

d / m 0 u \ B (5 . 131) 

dt VVl - u 2 /cV 

It would be more convenient for somebody interested only in 
electron optics to postulate eqn (5.131) directly, rather than to 
start from the principle of relativity and the principle of the 
constancy of the velocity of light. Such an approach avoids a 
discussion of length contraction, time dilation, etc., since eqn 
(5.131) can be applied directly in the laboratory system and it is 
not necessary to discuss the transformation of co-ordinates and time 
from one reference frame to another. Such an approach has much 
to recommend it in an introductory course on atomic physics. The 
confirmation of the postulate [eqn (5.131)] would be, that pre¬ 
dictions based on eqn (5.131) are in excellent agreement with the 
experimental results (cf. Appendix 4). , 

The consideration of collisions led to the idea of the equivalence 
of inertial mass and energy. In Section 5.8 a survey was given of 
various processes in which energy was transformed from one form 
to another. It was shown that changes in inertial mass were 
associated with changes in all forms of energy. These results can be 
summarized as the law of the equivalence of inertial mass and 
energy. Energy and inertial mass are generally interpreted as 
different properties of matter, for example, the energy of a particle 
is generally interpreted as a measure of its capacity to do work, 
whereas the inertial mass of a particle is a measure of the resistance 
of the particle to changes in its motion. The law of the equivalence 
of mass and energy does not imply that mass is sometimes converted 
into energy, or vice versa, but states that the changes in one are 
accompanied by corresponding changes in the other, inertial mass 
and energy being proportional to each other. The law of con¬ 
servation of mass is therefore equivalent to the law of conservation 
of energy and they are sometimes combined together and called the 

law of conservation of mass-energy. 

In Section 5.6 it was postulated that both linear momentum and 
total energy were conserved in ‘point’ events. It was illustrated how 
these conservation laws led to predictions which were in agreement 
with the experimental results. It was possible to calculate relation¬ 
ships between the momenta and energies of the particles in a collision 
or decay process. It should be remembered that the laws of con¬ 
servation of momentum and energy do not give a complete theory 
of these processes. As an example, consider the Compton effect 
which was discussed in Section 5.8.3. The theory developed gave 
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the variation of frequency (and energy) of the scattered photon with 
angle, but did not give the probability that a collision took place 
neither did it give the relative probabilities of various scattering 
angles. In order to obtain a fuller theory extra hypotheses would 
have to be introduced, and the full theory of the Compton effect 
had to await the development of relativistic quantum mechanics 
and its application to the Compton effect by Klein and Nishina 20 . 
However, since the new theory is consistent with the theory of 
special relativity, the dynamical relationships between the particles 
in a collision calculated using the new theory must be the same as 
that calculated using the techniques developed in Sections 5.7 and 
o.o, subject to the limitations of the uncertainty principle. 

Total energy and linear momentum were conserved in all the 
phenomena described in Sections 5.7 and 5.8. It appears that 
phenomena, such as pair production, do not take place unless 
there is a charged particle present so that energy and momentum 
can be conserved. It should be emphasized that all the phenomena 
considered were events taking place at approximately one point of 
space at one instant of time. When two particles such as two electric 
charges, separated in space at points and * 2 in an inertial frame 
, give rise to electromagnetic forces on each other which lead to 
changes m their momenta, if momentum is conserved in one inertial 
frame 2, then, if the momenta of the individual charges are trans¬ 
formed to 27, the transformed values of momenta refer to times 
1 ~ V' 1 7 Dx il c ) and t' a = y(t — vx 2 /c 2 ) respectively, and so do not 
refer to the same time in 27'. Since the momenta are changing 
continuously it is not possible to formulate a law of conservation 
of momentum for spatially separated interacting charged particles 
which is valid m all inertial frames, unless one attributes momentum 
to the electromagnetic field. The electromagnetic forces between 
separated moving charges is discussed in detail by Rosser 3 , Section 
7.5. Similar arguments can be applied to colliding particles of finite 
extension. There are, however, a very wide range of phenomena 
which can be considered, to a good approximation, as events taking 
place at one point of space at one instant of time and experiments 
have shown that in these events the law of conservation of linear 
momentum and the law of conservation of mass-energy are valid. 

It must be remembered that the last statement must be qualified 
to the extent of the uncertainty principle. 

After discussing the mechanics of‘point’ particles, it is normal in 
Newtonian mechanics to extend the theory to rigid bodies of finite 
dimensions, the mechanics of continuous medii, hydrodynamics, etc. 
As fluids and large bodies almost invariably move with velocities 
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very much smaller than the velocity of light, the deviations from 
Newtonian mechanics are not important in practice. Furthermore, 
it is more convenient to develop these topics using tensor methods. 
For these reasons these topics will not be considered here, but the 
interested reader is referred to the books by Meller, Tolman, Pau 1 , 
Fock, etc. The present discussion will be confined to a few comments 

about ‘rigid’ bodies. . , , , 

In Newtonian mechanics, a rigid body is defined to be a body 
in which the distance between any two particles making up the body 
is an invariant. If one of these two particles is acted on by a force, 
which has a component along the line joining the particles, such 
that one of the particles is moved, then according to Newtonian 
mechanics, this force must produce instantaneously a similar change 
in the position of the second particle, so that the separation of the 
two particles remains the same; otherwise the separation of the two 
particles would change and the body would not be a rigid body 
according to the classical definition. Therefore, a rigid body, in 
the classical sense, should be able to transmit forces between its 
constituent parts with infinite speed. According to the theory ol 
special relativity, energy and momentum (and hence forces) cannot 
be transmitted with a velocity exceeding the velocity of light, so 
that according to the theory of special relativity there are no rigid 
bodies in the strict classical sense, and the concept of a rigid body is 
rarely introduced into the theory of special relativity. Sometimes 
the concept of a rigid body is introduced for purposes of discussion, 
for example, to quote McCrea: 21 

We shall therefore now define a rigid rod as one along which impulses are 

transmitted with speed c. . 

Since our rod is being treated as a one-dimensional body and since we are 

dealing only with longitudinal motion, no other condition is required. 

Moreover, since the theory permits the existence of no more rigid hoc y 

of this sort, there is no objection to adopting the term rigid in this sense. 

At first sight it appears that this new definition does away with 
the normal ideas of everyday solid bodies such as a steel rod. 
Consider the propagation of longitudinal waves in such a rod. Let 
the rod be uniform, and let it be composed of a material of proper 
mass m 0 per unit length, and let the modulus of elasticity be A. 
To quote McCrea again: 

According to both classical and relativistic mechanics the speed of 

longitudinal waves of small amplitude is then v where 

v 2 = A/m 0 
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Since there is no dispersion, this is also the speed at which impulses of 
small amplitude traverse the rod. We have seen that relativity theory 
requires v < c and hence we must have 

X < m 0 <? 

Thus relativity theory imposes an upper limit m Q c 2 upon the modulus X. 

In practice X/m 0 (which is equal to the ratio of Young’s modulus to 
the density) is about 3 x 10 11 c.g.s. units for steel, whilst c 2 is 
9 x 10 20 c.g.s. units, so that the relativistic upper limit is not of any 
practical significance, even for the most rigid substances known. 

A classical rigid body is often used in classical physics as the ideal 
reference frame for measuring co-ordinates. Such a reference frame 
cannot be used in relativity theory since lengths are not invariant. 
On the other hand, according to the theory of special relativity, light 
moves in straight lines with the same speed in all inertial frames, 
so that straight lines can be defined in terms of light paths, and 
distances measured using the radar methods described in Section 3.9. 

Generally, quantum mechanical problems have not been con¬ 
sidered in the text. The reader is advised to have a thorough 
grounding in non-relativistic quantum mechanics before proceeding 
to relativistic quantum mechanics. It will be shown in Section 6.5 
that the de Broglie relation X = h/p is consistent with the theory 
of special relativity. Schrodinger’s equation is a non-relativistic 
equation and it can be shown that if the change of potential over 
the width of the wave pulse representing a particle is negligible, 
then Schrodinger’s equation approximates closely to Newton’s laws 
of motion. In order to obtain equations valid at relativistic energies 
one must start from the Hamiltonian given by eqn (5.48). The 
best known of these equations is the Dirac equation, which has been 
shown to be applicable to electrons (of spin J). The interested 
reader is referred to Mandl 22 . 
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PROBLEMS 

Problem 5.1— Show how, from a consideration of the elastic collision of 
two spheres, the theory of Newtonian mechanics must be modified if it is 
to become consistent with the theory of special relativity. 

Problem 5.2— -Discuss, critically, the experimental evidence for the varia¬ 
tion of mass with velocity. 

Problem 5.3— Calculate the velocities of electrons accelerated through 
potential differences of (a) 10,000; ( b ) 100,000; and (c) 1,000,000 V. What 
is the ratio of the relativistic mass to the rest mass in each instance ? (Hint.. 
Use trigonometrical substitutions to calculate m/m 0 from u[c.) 

Problem 5.4— Calculate the velocities of protons of kinetic energies 
(a) 10; ( b ) 100, and (c) 1,000 MeV. 

Problem 5.5— Show that if a particle is highly relativistic (y = (1 - 
„2W)-i ^>1), the fractional difference between c and v is approximately 
i{(m 0 c 2 ) IE} 2 , where E is the total energy. 

Problem 5.6 —Calculate the radii of curvature of electrons of velocities: 
(a) 0-3c; (4) 0-8c; (c) 0-99c, and (d) 0-999c in a magnetic field of strength 
1 Wb/m 2 (or 10,000 G). 

Problem 5.7— Calculate the radius of curvature of a proton of velocity 
0-lc in a magnetic field of strength 1 Wb/m 2 (or 10,000 G). 

Problem 5.8— Describe a (3-ray spectrometer capable of focusing electrons 
of energies up to 3 MeV. . 

Problem 5.9— Discuss the physical principles underlying the following 
accelerators: (a) cyclotron; (4) synchrocyclotron; (c) electron synchrotron; 
(d) electron linear accelerator (Reference: McMillan 6 ). 

Problem 5.10 — A particle of rest mass M moving with velocity u collides 
with a stationary particle of rest mass m. If the particles stick together show 
that the speed of the composite ball is equal to u[yM[(yM + mj] where 
y _ 1/(1 _ u 2 [c 2 ) I. (Hint: Proceed as in Section 5.7., but allow for the 
different masses of the colliding particles.) 
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Problem 5.11 —A rocket propels itself rectilinearly through empty space 
by emitting pure radiation in the direction opposite to its motion. If V 
is its final velocity relative to its initial rest frame, prove that the ratio of 
the initial to the final rest-mass of the rocket is given by 


Mi _ p + Fl* 
M f [c - V_ 


(Rindler 1960) 


Problem 5.12 —A hundred /^-mesons of rest mass 206 electron masses and 
energy 4-75 GeV are produced at an altitude of 30 km. If the mean life 
of/i-mesons at rest is 2-2 x 10~ 6 sec, calculate how many of the ^-mesons 
should reach sea level (a) allowing for time dilation, and ( b) neglecting 
time dilation, if they travel vertically downwards without ionization loss. 

Problem 5.13 —The diameter of our galaxy is about 10 5 light years. How 
long does it take a proton (in the proton rest frame) to pass through the 
galaxy if its energy is (a) 10 ls eV; (b) 10 17 eV; (c) 10 19 eV? 

Problem 5.14 —Calculate the amount of work in MeV that must be done 
to increase the velocity of an electron (a) to half the velocity of light (b) to 
three quarters the velocity of light. What is the ratio of the relativistic mass 
to the rest mass in each instance ? 

Problem 5.15 —‘Through what potential must an electron fall if, according 
to Newtonian mechanics, its velocity is to equal the velocity of light? 
What speed does the electron actually acquire according to the theory of 
special relativity ? 

Problem 5.16 —By what fraction does the mass of water increase, due to the 
increase in its thermal energy, it if is heated from 20°C to 50°C ? 

Problem 5.17 An incident photon of energy hv is scattered through an 
angle of 180 degrees in a Compton-type collision with a stationary electron. 
Calculate the total energies after the collision of ( a) the electron (b) the 
photon. [Hint: Apply the laws of conservation of total energy and of linear 
momentum in the lab.-system. Check your answer using eqns (5.90) and 
(5.93).] 

Problem 5.18 —Show that, if the electron recoils at an angle <f> to the 
direction of motion of the incident photon of energy hv in a Compton-type 
collision, then its kinetic energy is given by 


T-fr - Nacosy 

(1 + a)* - a 2 cos 2 * 

u hv 

where a = —~ 2 . [Reference: Semat, Foundations of Atomic and Nuclear 

Physics, 3rd Edition, Appendix 7. Check the answer using eqn (5.93).] 
Problem 5.19—A particle of rest mass m x having kinetic energy 7\ collides 
with a stationary particle of rest mass m 2 . If m 1 is scattered through an 
angle of 90 degrees and has momentum p[ after the collision, show 
that, after collision, m 2 has momentum {p’\ + 27> 1 + Tf/c 2 )! and makes 
an angle 0 4 with the direction of the incident particle equal to 
tan- 7 {/>[/(27>, + Tf/c 2 )*}. 
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Problem 5.20 —Calculate the total binding energy of one helium nucleus, 
if the mass of a hydrogen atom is 1-00814 a.m.u., that of a neutron 
1-00898 a.m.u. and that of a helium atom 4-00388 a.m.u. (The masses 
are measured on the la O scale.) 

Problem 5.21 —Calculate the energy released in the following reactions: 

(a) *Li + *H — 2|He 

(b) »N + *He - + JH 

(Hint: Look up accurate tables of a.m.u. and estimate the mass difference.) 

Problem 5.22 —The earth receives solar energy at the rate of 1-35 X 
10 3 W/m 2 . If the distance of the earth from the sun is 150,000,000 km, find 
(a) the total mass lost by the sun per second; (b) the pressure of solar 
radiation on the earth assuming the earth is a black body. 

Problem 5.23 —Describe how the aberration of light from stars and the 
Doppler effect can be interpreted in terms of the momentum and energy 
transformations for photons. 

Problem 5.24 —Show that the rest mass of a particle is given by 

p 2 c 2 - r 2 
m n -— n 

0 27c 2 

where p is its momentum and T its kinetic energy. Calculate the rest 
mass of a particle if its momentum is 130 MeV/c when its kinetic energy is 
50 MeV. 

Problem 5.25 —An unstable particle of rest mass M 0 and momentum p 0 
decays in flight into two particles of rest masses M x and M 2 , momenta p x 
and p 2 and total energies E x and E 2 , respectively. Show that 

My = (M x + M 2 ) 2 <* + 2 E x E 2 - 2M x My - Ipipy cos e 

where 8 is the angle between the two decay particles. [Hint: Apply 
conservation of momentum and of mass-energy in the lab.-system.] 

Problem 5.26- —An unstable neutral particle decays into two charged 
particles ofkinetic energies 190 MeV and 30 MeV and momenta 300 MeV/c 
and 240 MeV/c respectively. Determine the masses of the decay products. 
If the angle between the decay particles is 45 degrees, determine (a) the 
rest mass of the neutral particle, ( b ) its momentum, and (c) its kinetic 
energy. [Use m 0 = (p 2 c 2 — T 2 ) /2 Tc 2 to determine the masses of the decay 
products. Apply conservation of total energy and of momentum in the 
lab.-system to determine the energy and momentum of the parent particle; 
hence determine its mass.] 

Problem 5.27 —Determine the maximum kinetic energy in MeV, of the 
electron arising from fi- decay if 

ft ->■ e + v + v 

The rest mass of the /t-meson is 206-8 electron masses, and the masses of 
the neutrino and antineutrino may be taken to be zero. 
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Problem 5.28 —A w "-meson of rest mass m 0 , velocity u, y = (1 _ u 2 [ c t\-i 
decays in flight into two photons. If one of the photons is emitted at an 
angle 6 to the direction of motion of the 7r°-meson in the laboratory system 
show that its energy hv is given by 


hv = m 0 c 2 /{2y(l — u cos 0/e)} 

Problem 5.29 A particle of momentum p 1; rest mass m x , is incident upon a 
stationary particle of rest mass m 2 . Show that the velocity of the c.m.-system 
is equal to 

Pi* 8 __ _ PjC 2 

( £ 1 + E g ) v'Xpfc 2 + m *(4) + 

Problem 5.30— Show that if a proton of velocity u, y = (1 u 2 jc 2 )~I 

collides with another proton, the velocity of the centre of mass is given by 


Show also that 


w c.m. 


yu 

(y + i) 


^c.m. 


v + n* 
2 


[Hint: Use eqn (5.80).] 

Problem 5.31 A particle of rest mass m x , velocity u and total energy E, 
collides with a stationary particle of mass m a . Show that in the extreme 
relativistic case, u === c, the energy available in the c.m.-system is approxi¬ 
mately (2£ 1 m 2 c 2 )I. Show that the velocity of the c.m.-system is 



Use et l n (5.121) to get the energy in the c.m.-system, and use 
(5.80) to determine the velocity of the c.m.-system.] 

Problem 5.32 A photon collides with a stationary proton to produce a 
7r+-meson according to the reaction 

y + p ~*n + n+ 

Find the threshold frequency v 0 for this process to take place. You may 
neglect the proton neutron mass difference. The rest mass of the v+-meson 
is 273-2 electron masses. 

If the mean lifetime ofair+-meson in its rest frame is r, obtain an expression 
for the mean life in the lab.-system of a rr+ created by a photon of frequency 
V Derive and use eqn (5.121) to show that the threshold kinetic 

energy is 150 MeV corresponding to a frequency of 3-6 X 10 22 c/sec. 
Calculate the velocity of the c.m.-system and calculate the dilation of 
lifetimes relative to the laboratory.] 

Problem 5.33 A photon incident on a hydrogen target produces a 
it -meson (of rest mass 273-2 electron masses) according to the reaction 

y + p -*■ n + 7T+ 
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If the 7r + -meson has a kinetic energy of 50 MeV, and is emitted at an angle 
of 90 degrees to the direction of motion of the incident photon, calculate 
the momentum of the ir + -meson in the lab.-system, and the energy of the 
incident y-ray. 

Problem 5.34 —Show that, if total energy and momentum are to be 
conserved in pair production by a photon, the process cannot take place 
spontaneously in a vacuum. 

If a photon strikes a stationary electron giving rise to an electron-positron 
pair as well as a recoil electron, show that the threshold for the reaction is 
4m 0 c 2 , where m 0 is the rest mass of an electron. [Hint: Assume for purposes 
of discussion that pair production takes place in a vacuum. The minimum 

hv . 

electron momenta would be — , when both electrons were emitted in the 

Lc 

direction of the incident photon. For this case the sums of the total energies 

( /M 2 \* 

m\<* + 1 — 1 c 2 J which would be greater 


than hv, so that energy would not be conserved. For the second part of 
the question use eqn (5.124).] 

Problem 5.35 —Show that in positron annihilation at least two photons 
must be produced if linear momentum is to be conserved. Calculate the 
energies of the two photons, if the positron and electron are at rest when 
they annihilate each other. 

Problem 5.36 —Calculate the threshold kinetic energies, in MeV, for the 
following processes 


{a) y + p -+ p + re 0 

( b ) 7r -f- p —► + rf~ -f- n 

W P+P-*P+P + P + P 


The rest masses of p, w+ and ir° are 1836, 273 and 264 electron masses 
respectively. [Hint: Derive and use eqn (5.124).] 

Problem 5.37 —A stationary nucleus of mass M is in an excited state of 
excitation energy E 0 . It emits a photon in a transition to the ground state. 
Show, using relativistic equations, that the frequency of the photon, after 
allowing for recoil, is 


v 



JV 

Me 3 


where h is Planck’s constant. [Hint: Derive and use eqn (5.116).] 

Problem 5.38 —In the collision of a very high energy proton, of velocity 
«„ and y = (1 — u„[c 2 )~i 1 with a stationary proton, 77-mesons are 

emitted symmetrically in the c.m.-system with velocity a', that is the 
number of 77-mesons in the backward direction is equal to the number in 
the forward direction in the c.m.-system. Show that the angle in the 
lab.-system containing half the emitted 77-mesons, is given by 


tan 


u' ^ u' 2i 
Yc.mMc.rn. ~ 
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[Him: Consider a ir-meson emitted at 90 degrees in the c.m.-system and 
ftnd its direction in the Iab.-system.] 

Problem 5.39— A particle of charge q, rest mass m 0 , is projected from the 
origin with velocity u g along they axis. If there is a uniform electric field 
E parallel to the * axis, show that the orbit of the particle is a catenary 
, W a , qEy 

given hyx — ~ cosh —, where p 0 is the initial momentum, W a = (m-jc 4 + 

c 2 ^)l is the initial total energy of the particle and E is the electric intensity 
vector. [Hint: The equations of motion are jp x — qE and p y = 0. Inte¬ 
gration gives p x = qEt and p y = p 0 . Now if W is the total energy at a time t 

x = PA\ W = qEt<?!{Wl + cYEV)i 

and } = pYiw = pyuwi + cy&tY 

Integrate with respect to t and eliminate /.] 

Problem 5.40—What changes must be made to non-relativistic electron 
optics so that the theory can be applied to particles having velocities 
comparable to the velocity of light? 

A large parallel plate capacitor, in vacuo, is charged to a potential differ¬ 
ence of one million volts. The plates are 5 cm apart. Calculate how long it 
will take an electron starting from rest to travel from the negative to the 
positive plate. The charge and rest mass of the electron are 1 -60 x 10— 19 C 
and 9-11 x 1(C 31 kg respectively. The velocity of light is 3 x 10 8 m/sec 
[Hint: Derive and use eqn (5.37).] 

Problem 5.41 A particle P of rest mass m 0 is attracted towards the origin 
0 of an inertial frame by a Newtonian force of magnitude m g fijr i where 
r = ^ anda constant. Prove that the motion of the particle is confined 
to a plane and that the differential equation of the orbit in terms of polar 
co-ordinates r, 8 in the plane is 



where u ~ 1/r, and h = r 2 dd/ds is the (constant) relativistic angular 
momentum per unit mass. 

Show that by choice of initial line the solution of this equation may be 
written in the form 

lu — 1 + e cos kO 

where k = (1 — and /, e are constants. Describe the motion 

briefly for the case n <f, hc,e < 1. (Exeter 1960) [Reference: W H 
McCrea, Relativity Physics, p. 29.] 
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GEOMETRICAL REPRESENTATION OF 
LORENTZ TRANSFORMATIONS AND 
THE USE OF 4-VECTORS 

6.1. GEOMETRICAL REPRESENTATION OF THE 
GALILEAN TRANSFORMATIONS 


The position of a particle in three dimensions can be specified by the 
x, y and z co-ordinates of the point. If the particle is moving, then 
these co-ordinates vary with time. It is often convenient to represent 



(a) (b) 

Figure 6.1. (a) The displacement of the particle P along the x axis of an inertial frame 27 is 
represented by the curve NR in the Ox, Ot rectangular co-ordinate system. The displacement 
of P relative to 27' can be represented by the same curve NR, if Ox' and Ot' are chosen as 
oblique axes to represent the displacement of P relative to 27'. A new length must be chosen to 
represent the unit of time in the oblique co-ordinate system. All that must be given are the 
‘world lines' such as NR, the origin 0, the x axis, the length of the unit of x and the time 
calibration line AD which are all shown by continuous lines in ( b). From this data a suitable 
co-ordinate system can be chosen 

this motion graphically. For simplicity we shall start by considering 
motion along the * axis only. Choose a rectangular co-ordinate 
system to represent the displacement of the particle relative to the 
inertial frame E. The displacement of a particle P along the x axis 
of E will be plotted against the time t as shown in Figure 6.1. Let 
the motion of a particle P relative to E be represented by the curve 
NR, and let the particle be at the position R having a displacement 
x at a time t. 

188 


~Piifuc±- Tfeat/tewKitiea/ 



REPRESENTATION OF THE GALILEAN TRANSFORMATIONS 

f™ W ^ 0nS i de u thC m ° ti0n ° f the Same P article from an inertial 
trame Z , which is moving with uniform velocity o relative to 27 

along the common * axis. If the origins of 27 and 2’ coincide at 
0, then according to the Galilean transformations one has 
x' = x — vt; t’ — t 

Since *' = 0 at the origin of 2', the motion of the origin of 2’ 
relative to the origin of 2 is given by the line * = vt. This is repre¬ 
sented by the line OQ in Figure 6.1. The lines Ox and OQ will now 
be used as oblique axes Ox' and Ot' respectively to represent the 
motion of the particle P relative to the inertial frame 2’. We shall 
start by determining the *' and f co-ordinates of the point R 
without associating them in any way, at the moment, with the 
motion of the particle P. In order to get the *' co-ordinate of the 
pomt i?, draw a line parallel to the Ot' axis to cut the Ox' axis at 
~ , * c °-° rdl nate of the point R can be represented by OS. 

im ilar ly, with a suitable choice of scale, the intercept of the line 
RU on the Ot axis can be used to represent the t' co-ordinate of the 
point R m the oblique co-ordinate system. Relative to the original 
ectangular axes Ox, Ot, which were used to represent the displace¬ 
ment of the particle P relative to 2, 


UQ — vt and UR = * 

OS = QR = UR — UQ — x 


vt 


But, according to the Galilean transformations, the *' co-ordinate 
m Z , corresponding to the * co-ordinate at a time t in 2, is given by 

x' = X — vt 

Therefore, if the same scale is used along the Ox' axis as along the 

r?F? S, / ; ' S > Can bC U ™ d t0 re P resent th e displacement of the 
particle T relative to 2 , when the particle P is at the point R 
having a displacement * at a time t relative to 2. The point R 
need not be displaced in the x' direction in order to represent the 
displacement of the particle P, relative to 2'. 

In Newtonian mechanics time is absolute so that time intervals 
must be measured to have the same numerical value in both 2 and 

tt ‘ n ?!. l 'T re d '/. the hmgth OQ is greater than the length OU. 
Hence, if the position of the point R relative to the Ox' axis is to 
represent the displacement of the particle P at a time t' =-. t in 2' 
then a new time scale must be chosen for the oblique co-ordinate 
system Ox Ot . Draw the line ABCD corresponding to the time 
i t , e orl g ma l rectangular co-ordinate system Ox, Ot The 
length OB represents the unit of time in 27. If OC is chosen as the 
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new unit of time in the oblique co-ordinate system Ox', Ot', then 
the time t’ represented by the point R in the Ox', Ot' co-ordinate 
system is equal to the time t represented by the point R in the 
co-ordinate system Ox, Ot. Thus using the oblique co-ordinate 
system Ox', Ot' to represent the displacement of the particle P 
relative to E', and using a new time scale, the position of the point 
R need not be changed in Figure 6.1, and the curve NR can be used 
to represent the displacement of the particle P relative to both 
E and E'. 

An oblique co-ordinate system, with the same x axis as E, but a 
different time axis and a different time scale, can be chosen to 
represent the displacement of the particle P relative to any other 
inertial frame which is moving with uniform velocity relative to E 
along the x axis, whose origin coincides with the origin of E at 
t = 0. We need not have started with a rectangular co-ordinate 
system, but could have started with oblique axes. The unit of time 
is cut off on all the time axes by the line AD. 

If the particle P moves in two dimensions in the xy plane, the 
motion can be represented in a 3-dimensional co-ordinate system 
x, y and t. For motion in three dimensions one would have to use 
the four dimensions x, y, z and t to represent the displacement. It 
should be emphasized that there is nothing more unusual about 
using the four dimensions x, y, z and t to represent the motion of the 
particle in the theory of special relativity, than in using the same 
number of dimensions to represent the same co-ordinates and time 
in Newtonian mechanics. Provided the length of the unit of time 
is chosen appropriately, the curve NR, which represents the dis¬ 
placement of the particle P, is the same for all the co-ordinate 
systems. It is called the ‘world line’ of the particle P. If the 
particle P moves in a straight line with uniform velocity relative to 
an inertial frame, the ‘world line’ is a straight line. If the particle 
is accelerated its ‘world line’ is curved. If the ‘world line’ NR and the 
origin O are given, then for motion in the x direction, one only needs 
01 the unit of length in the x direction and the line AD as shown 
in Figure 6.1(b). The unit of time for any co-ordinate system can be 
calculated, if the line AD is given. The x axis is common to all the 
possible co-ordinate systems representing the various inertial fram.es 
moving with uniform velocities relative to each other and whose 
origins coincide at t = 0. In order to select a suitable time axis 
draw any line from 0 to cut the line AD as shown in Figure 6.1(b). 
The unit of time for this co-ordinate system is represented by the 
distance from O to the point of intersection of AD and Ot'. The 
velocity of this inertial frame relative to E is given by v — tan <f>. 
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6.2. GEOMETRICAL REPRESENTATION OF THE 
LORENTZ TRANSFORMATIONS* 

We shall again start by considering the motion of a particle P 
a ong the x axis of the inertial frame 27, and start with a rectangular 
Cartesian co-ordinate system to represent the displacement of the 
particle P relative to 27. Instead of plotting x against t, x will be 
plotted against ct, where c is the velocity of light. This gives the 




m=0 


(a) 


(b) 


Figure 6.2. (a),Tfu: displacement of theparticle P relative to the inertial,frame Z is represented 
by the curve NR in the Ox Om rectangular co-ordinate system. The lines Ox' and Om' 
which are inclined at angles <f>< = tan 1 vjc to the Ox and Om axes respectively can be used as 
oblique axes to represent the displacement of the particle P relative to the inertial frame Z'. 

iZV C r, r S T g ‘& are ch P en / or the mits of *' and m’, then the displacement of the 
particle P relative to E can also be represented by the same curve NR, (b) The pauses 
appropriate to E and E' 


ordinate axis the dimensions of a length. Let ct be denoted by m. 
This diagram is called a space-time or Minkowski diagram (cf. Section 
3.9 and Appendix 6 h). 

Let the motion of the particle P relative to 27 be represented by the 
curve NR, as shown in Figure 6.2(a). We proceed now to see if 
the same curve NR can be used to represent the displacement of the 
particle P relative to an inertial frame 27', which is moving with 
uniform velocity v along the x axis relative to 27. From the Lorentz 
transformations 


*' = Y( x ~ vt) ( 6 . 1 ) 

t’ = y(t — vx j c 2) (6.2) 


* S ° me rea u der 7 ma V P re |" to proceed to Section 6.3 before reading Section 6.2. 
f, PP. roach adopted m 6.3 is based on the co-ordinates x, y, z, i ct. The latter 
method is a complete alternative to the method developed in Section 6.2 and is 
mathematically the simpler of the two. 
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Relative to the rectangular axes Ox, Om the lines t' = m! — 0 and 
x' = 0 are given by the equations m = ct = vx/c and x = vt = vmjc 
respectively. These lines are represented by Ox' and Om' in 
Figure 6.2(a). The line Ox' makes an angle <f> with the Ox axis and 
the line Om' makes an angle <f> with the Om axis and an angle of 
(ir/2 — <f>) with the Ox axis, as shown in Figure 6.2(a), where 
tan <f> = v/c. The same plan will be followed as in Section 6.1, and 
Ox' and Om' will be used as oblique axes to represent the displace¬ 
ment of the particle P relative to £'. Unlike Newtonian mechanics, 
in addition to the time axis, the direction of the co-ordinate axis is 
also changed, so that one must be prepared to change the unit of 
length as well as the unit of time in the Ox’, Om' co-ordinate system. 
The unit of length in the Ox', Om’ co-ordinate system is given by the 
distance of the point x' = 1 ,m — ct' = 0, from the origin as shown 
in Figure 6.2(b). Relative to the original Ox, Om rectangular co¬ 
ordinate system, this point has co-ordinates 


x = y(x' + vt') = y( 1 + y0) = y (6-3) 



The distance of this point from the origin, relative to 2, is equal to 
g( = V* 2 + m 2 = V* 2 + cV 

g{ = yV 1 + v 2 /c 2 = A / [ 1 1 ' * V JjJj (6.5) 

If lengths along the Ox' axis are measured using the scale (or ruler) 
appropriate to the Ox, Om rectangular co-ordinate system, then in 
order to convert these measured lengths into lengths appropriate 
to the Ox', Om' co-ordinate system, the measured lengths must be 
divided by g' v The quantity g( is sometimes called the gauge of the 
co-ordinate system representing the displacement of the particle P 
relative to 2'. 

Similarly, the distance of the point m' = ct’ = 1; x' = 0 from the 
origin of Figure 6.2(b) gives the length of the unit of m! relative to the 
rectangular co-ordinate system Ox, Om. Using the Lorentz trans¬ 
formations corresponding to m' = 1, x' = 0 in the oblique axes 
Ox', Om', in the rectangular co-ordinate system Ox, Om one has 
x = yv/c and m = y so that 

g* = yVl + f 2 /r 2 (6.6) 

Lengths measured along the Om axis using the scale (or ruler) appro¬ 
priate to the rectangular axes Ox, Om must be divided by g' z in order 
to obtain the values of m' appropriate to the oblique axes Ox', Om’. 
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The co-ordinates of the point R shown in Figure 6.2(a) will now 
be calculated relative to the oblique axes Ox', Orri. For the present 
the point R will not be associated in any way with the displacement 
of the Particle P relative to Z'. Draw lines through R parallel to the 
Ox and Om axes to cut lengths OT and OW on the Om' and Ox' 
axes respectively as shown in Figure 6.3. Let these be measured, 



Figure 6.3. A more detailed form of Figure 6.2(a) 

initially, using the scales (or gauges) of the rectangular co-ordinate 
system Ox, Om. Now, in Figure 6.3 

OV = x; UV — m tan cf> = vmlc — vt 

Hence, 

OU=OV- UV= (x- vt) 


In the triangle OUW, the angles WOU, OUW and OWU are equal 

to <p, 7T/2 + </> and 77/2 — 2 <f> respectively as shown in Figure 6.3. 
Hence * 


OW 


OU 

WU 


(6.7) 

sin ( 77/2 + 

Therefore 

= 

sin ( 77/2 — 

2 <j>) sin <j> 


OW — 

OU cos <f> (x — 

vt) cos <f> 


(6.8) 

Since, 

cos 

2 <j> cos 2<j> 


tan <j> = vjc, < 

:os ^ 

c 

1 




V c 2 + V 2 

Vl + v 2 /c 2 



cos 2<f> = 2 cos 2 (f> 

- 1 

2 

■ i-^ 1 - 

V 2 fc 2 ) 



(1 + v 2 jc 2 ) 

(1 + 

v 2 jc 2 ) 
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Substituting in eqn (6,8) one obtains 

(* - *Q/( 1 + v 2 lc 2 ) +i (x-vt)(l+ 

UVy ~ (1 - z/ 2 /c 2 )/(l + v 2 /c 2 ) (1 - V^C 2 ) 


(6.9) 


This is the length of 0 W measured using the scale appropriate to the 
rectangular co-ordinate system Ox, Om. In the oblique co-ordinate 
system Ox', Om' the unit of length must be changed and the length 
OW given by eqn (6.9) must be divided by the gauge 

g[ = yO + ^A 2 ) 4 

Hence, relative to Ox', Om', the distance OW represents a length 
equal to 

(jv — wf) (1 + d 2 /c 2 )* _ (1 + v 2 jc^ (x — vt) 

(I _ p*/ C ») ^ (1 - v 2 /c 2 )i “ Vl - V 2 jc 2 


According to the Lorentz transformations, when the particle P is 
at the point x at a time t relative to 27, its displacement relative to 
27' is equal to 

x = y{x — vt) 


Hence, using the new choice of gauge given by eqn (6.5), OW is 
equal to the x' co-ordinate of the particle P in 27', corresponding 
to the event at R having a co-ordinate x at a time t in 27. 

In Figure 6.3 using the scales appropriate to the rectangular axes 
Ox, Om, 


OT — WR = RU — 


WU = 


m 

cos <f> 


-WU = 


ct 

COS <j) 


From eqn (6.7), 

OU sin if> 


WU = 


sin (i - 2 ^) 


Of/sin <f> 
cos 2<f> 


wu 


(* “ Vt) ( C 2 + V 2)t ( X ~ Vl) ~C (1 + "W 


(1 - v 2 /c 2 ) 


(1 - v 2 /c 2 ) 


(1 + v 2 /c 2 ) 

(x — vt) - (1 + v 2 }c 2 ) k 

OT = RU — WU — ct(l -j- v 2 lc 2 )* 

(1 + V 2 jc 2 ) I 


OT = 


(1 - v 2 jc 2 ) 


(- 7 ) 


(1 - v 2 /c 2 ) 


( 6 . 10 ) 
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This is the length of OT measured using the scale appropriate to 
the rectangular co-ordmate system Ox, Om. In order to find the 
Jmgth this distance represents in the oblique co-ordinate system 

whi'ch by } mmt diVided by the >PP r °P riaK gauge, 

' " ( 6 - 6 ) 


g' 2 = yVl + o 2 /c 2 

Dividing the right-hand side of eqn (6.10) by g’ 2 , one obtains 

(1 + yZ/c 2 )* / daA -— / vx \ 

(1 -r 2 /c 2 ) \ Ct ~ 7) ^ yA/l + v ! c% = y\ct- — j 

Relative to the oblique co-ordinate system Ox', Om' the distance OT 
represents a length y(ct - vx/c). According to the Lorentz trans¬ 
formations corresponding to the time t, when the particle P has a 
displacement at in Z, one has in Z' 


ct 


,/ / . vx\ 

= y[ ct -7) 


Hence, with the new choice of gauge for m', the m' co-ordinate of the 
point R is exactly equal to the value predicted by the Lorentz 
ransformations for the particle P at the time t when it has a dis¬ 
placement * in Z. Thus by choosing Ox' and Om' as oblique axes 
o represent the displacement x' of the particle P at a time t' in Z' 
corresponding to the displacement rata time t in Z, and using the 
new gauges given by eqns (6.5) and (6.6) respectively, the position 
of the point R on the graph shown in Figure 6.2(a) need not be 
changed. This conclusion holds whatever the relative velocity 
between the two inertial frames, provided the relative velocity is 
constant If the displacement of the particle P relative to the 
inertial frame Z is represented by the curve NR in the Ox, Om 
rectangular co-ordmate system as shown in Figure 6.2(a), then 
provided the scales (or gauges) for *' and ct' are chosen according to 
eqns (6.5) and (6.6), then the displacement of the particle P relative 
to the inertial frame Z' is represented by the same curve NR in the 
oblique co-ordinate system Ox', Om'. This holds for all the inertial 
frames moving with uniform velocity along the at axis relative to Z 
whose origins coincide with each other at t = 0. Once the axes 

reP [ eS ^ ting ° ne inertial frame are chosen, the path NR is fixed. The 
path NR is called the ‘world Hue’ of the particle P in the two dimen- 
sional ac, ct world’. We started with rectangular axes in order to 
simplify the discussion. This was not necessary; one could have 
started with oblique axes. 

Some of the geometrical relationships between the scales (gauges! 
corresponding to the various inertial frames are now discussed. 
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For the inertial frame E' the length OA' from the origin to the point 
A' (having co-ordinates x' = 1, m! = 0) in Figure 6.4(a) gives the 
length of the unit of*' along the Ox' axis. According to the Lorentz 
transformations, relative to the rectangular co-ordinate system 
Ox, Om, used to represent the displacement of the particle P 
relative to E, the point A' must have the co-ordinates x = y, 
m = ct = y(v/c). For the point A’ 

x 2 — m 2 = x 2 — cH 2 = y 2 — y 2 v 2 jc 2 = y 2 (l — v 2 Jc 2 ) = +1 

The expression (x 2 — m 2 ) is independent of v, the relative velocity 
of E’ and E. Relative to the rectangular co-ordinate system Ox, Om the 



Figure 6.4. In (a) the calibration hyperbolae are shown. These enable the unit of length and 
time in any co-ordinate system such as Ox', Om' to be determined. The unit of x' for the 
Ox' axis shown in {a) is equal to OA', and the unit of m! is equal to OB'. The positions 
of the ‘ world lines’ relative to the calibration hyperbolae are independent of the choice of any 
particular co-ordinate system. All that must be given are the ‘world lines' and the calibration 
hyperbolae shown by full lines in (b). The asymptotes can then be drawn and a co-ordinate 
system such as Ox', Om' chosen 

point A', whose distance from the origin gives the unit of length for 
the co-ordinate system used to represent the displacement relative 
to E', always lies on the curve 

x 2 — m 2 = x 2 — c 2 t 2 = G = +1 (6.11) 

whatever the value of v. If the relative velocity between E' and 
E were different, the position of the point A' which has co-ordinates 
x = y, m = ct — y(vjc), would vary since v and y would vary, but 
A' would always be on the curve x 2 — m 2 = +1 relative to E. Eqn 
(6.11) is the equation of a hyperbola. It is illustrated in Figure 
6.4(a). The unit of length for any oblique co-ordinate system Ox', 
Om! is given by the distance from the origin to the point of inter¬ 
section of the Ox' axis and the hyperbola corresponding to G — +1. 
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The unit of m is given by the length OB', where B' corresponds 
to the point x — 0; m' = ct' — 1 in Figure 6.4(a). According to the 
Lorentz transformations the x and m co-ordinates of B' are y(v/c) 
and y respectively. For the point B', relative to the rectangular 
co-ordinate system Ox, Om one has 



Thus B' always lies on the hyperbola 

x 2 — m 2 = G = — 1 (6.12) 

whatever the value of u. The unit of tn' in 2' is given by the 
distance from the origin to the point of intersection of the Om' axis 
with the hyperbola given by eqn (6.12). The two sets of hyperbolae, 
given by eqns (6.11) and (6.12) respectively, are known as the 
calibration hyperbolae. 

If new co-ordinates rj = x — m and £ = x -f- m are introduced, 
then the hyperbolae take the form £r) = ±1. The lines rj = 0, 
£ = 0 are the asymptotes to the hyperbolae, the line rj = 0 corre¬ 
sponding to x = ~\-ct and the line £ = 0 corresponding to x — — ct. 
These are the ‘world lines’ of light signals leaving the origin in the 
positive and negative x directions respectively at a time t = 0. 
These particular ‘world lines’ were introduced in Section 3.12 where 
it was shown that they separate events which can and cannot be 
causally connected with the event at the origin at the time 1 = 0 
(compare Figure 3.14). The line x = -\-ct = -\-m bisects the angle 
between the Ox and Om axes. Since the Ox' axis is inclined at the 
same angle <f> to the Ox axis as the Om axis is inclined to the Om 
axis, the line x = -f ct corresponding to jj = 0 also bisects the angle 
between the Ox' and Om' axes. 

In order to calculate the calibration curves we started with 
rectangular axes; this was not necessary, we could have started with 
oblique axes. All that need be given are the calibration hyper¬ 
bolae and the world lines as shown in Figure 6.4(b). The asymptotes 
to the calibration hyperbolae can then be drawn. In order to select 
a suitable co-ordinate system, one need only draw any straight 
line from the origin to cut the hyperbola x 2 — m 2 = G = +1, say 
at the point A' as shown in Figure 6.4(b) ; this line serves as the Ox' 
axis and OA' is the unit of length for this particular co-ordinate 
system. To obtain the corresponding Om' axis draw a straight line, 
from the origin, on the other side of the asymptote rj — 0, and at the 
same inclination to it as the Ox' axis has, as shown in Figure 6.4(b). 
This line is the appropriate Om' axis. The unit of m' is equal to 
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OB', where B' is the point of intersection of the Om' axis with the 
calibration hyperbola x 2 — m 2 = G = — 1. Any such oblique 
co-ordinate system, obtained in this way, can be used to describe 
the course of nature. The world lines are the same for all of them. 
According to the principle of relativity the laws of physics should 
be the same for all these inertial frames of reference. 

If the particle P moves in the xy plane one would have to use 3 
dimensions to represent the displacement of the particle at the time t. 
For motion in 3 dimensions one would have to use 4 dimensions 
corresponding to x,y, z,m = ct. Since y' = y and z! = z no change 
of gauge is necessary in thej and z directions and the Oy and Oy' 
axes are coincident, and so are the Oz and Oz axes. Minkowski 
(1908) suggested that the 4-dimensional x, y, z, ct space represents 
the ‘world.’. In this ‘world’ events or ‘world points’ are specified by 
the four co-ordinates x, y, z, ct. The motions of particles are repre¬ 
sented by ‘world lines’. The motion of a particle moving with 
uniform velocity is represented by a straight ‘world line’, accelerated 
motion is represented by a curved ‘world line’. These ‘world 
points’ and ‘world lines’ are the same for all the co-ordinate systems 
used to represent the displacements of particles in the various inertial 
frames moving with uniform velocity relative to each other and 
whose origins coincide at t = 0. In Newtonian mechanics the time 
interval between events and the lengths of objects are absolute. 
According to the theory of special relativity the numerical values 
of lengths and time intervals are not absolute, but depend on the 
inertial reference frame chosen to represent the motions. However, 
it was shown in Section 3.12 that the interval between ‘events’ is 
absolute, that is, has the same numerical value in all inertial frames 
of reference. The interval <5s was defined by the relation 

Ss 2 = dx 2 + dy 2 + dz 2 - c 2 dt 2 (3.39) 

Thus, though the spatial and temporal intervals between two events 
are not absolute, according to the theory of special relativity a 
certain combination of the two, namely the interval between the 
events, is absolute. So are the ‘world points’ and ‘world lines’. This 
is probably the way one should interpret the opening remarks which 
Minkowski made in his address delivered to the Eightieth Assembly 
of German Natural Scientists and Physicians at Cologne, on 21st 
September, 1908. In this address Minkowski introduced the geo¬ 
metrical interpretation of space and time for the first time. 
Minkowski said: 

The views of space and time which I wish to lay before you have sprung 

from the soil of experimental physics, and therein lies their strength. 
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They are radical. Henceforth space by itself, and time by itself, are 
doomed to fade away into mere shadows, and only a kind of union of the 
two will preserve an independent reality. 

In the 4-dimensional x, y, z , ct ‘world’ the kinematics of particles 
can be developed as a 4-dimensional geometry. There is an alter¬ 
native approa ch i n which the four variables x, y, z, i ct are used, 
where 1 = V—1. This approach is mathematically simpler and 
will be developed in Section 6.3. The development of relativistic 



Figure 6.5. x is plotted against t and not against m — ct as was 
done in Figure 6.4. As a result the diagram looks more like the 
corresponding case for Newtonian mechanics shown in Figure 6.1. 
1 he choice for the direction of the x' axis appears more restricted 
in Figure 6.5 than in Figure 6.4 


kinematics and relativistic mechanics in terms of 4-vectors will be 
discussed in Section 6.4. 

In Newtonian mechanics there is no choice of direction for the x' 
axis, once one set of axes is chosen. It appears at first sight, from 
Figure 6.4(b), that according to the theory of special relativity there 
is a very wide choice for the direction of the x' axis. The choice is 
exaggerated, since we chose to plot x against ct rather than against t, 
as was done in Figure 6.1, Section 6.1. Consequently, the ordinates 
m Figure 6.4(b) were magnified by a factor c compared with the 
ordmates in Figure 6.1. If x had been plotted against t. Figure 6.4 
would be more like Figure 6.5, which is still enormously exaggerated. 
The choice of direction for the x’ axis now appears very small. As 
c tends to infinity the equations of the theory of special relativity go 
over into the equations of Newtonian mechanics. This is true in 
the present case since, if c tends to infinity, the hyperbola corre¬ 
sponding to G = +1 tends to coincide with the * axis and the 
yperbola corresponding to G = — 1 tends to a straight line parallel 
to the x axis in agreement with Figure 6.1(b). 
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As examples of the use of the methods developed in this section 
the phenomena of length contraction and time dilation will be 
discussed. Draw the calibration curve G = 1 as shown in Figure 
6.6(a). Let a rod of unit length be at rest on the x axis of an inertial 
frame E, which will be represented by the rectangular co-ordinate 
system Ox, Om; the choice of rectangular axes is not essential. The 
positions of the ends of the rod are given by the ‘world lines’ Om 
and A A' respectively as shown in Figure 6.6(a). These ‘world lines’ 
are the same for all the possible co-ordinate systems. Let the axes 




Figure 6.6. The phenomenon of length contraction. In (a) a rod of unit length is at rest 
in Z, and the ‘world lines' of its ends are represented by OmandAA'. In ( b ) the rod is 
at rest in Z' and the * world lines' corresponding to the ends of the rod are represented 
by Om' and BB'. The ‘world line* BB' is a tangent to the calibration hyperbola at B 

appropriate to an inertial frame E', moving with velocity v relative 
to E, be Ox' and Om', such that tan cf> = vjc, as shown in Figure 6.6(a). 
The unit of x' for the oblique Ox', Om' axes is equal to the length 
OB. In order to measure the length of the rod in E', the positions 
of the ends of the rod must be observed at the same time t' in E', 
say at t' = 0. The distance between the ends of the rod at t' = 0 
is given by the distance between the points of intersection of the 
line m' = 0 with the two invariant ‘world lines’, that is by the 
length OC. But OC is less than OB the unit of length in E', so 
that the measured length of the rod in E' is less than unity, illustrat¬ 
ing length contraction. [A ctually in E', OC represents a length 
x = OCjg' 1 = sec 4>lg( — Vl— v 2 /c 2 .] 

If a rod of unit length is at rest along the x' axis of E', then the 
world lines of the ends of the rod are parallel to the Om' axis as 
shown in Figure 6.6(b) and are represented by the world lines Om! 
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and BB in Figure 6.6(b). The distance between the world lines in 
2 'ft the * = 0 is given by OE which is less than OA, the unit 

of length in X. This illustrates that length contraction is reciprocal. 

Now time dilation is illustrated. Draw the calibration curve 
“ 1 as shown in Figure 6.7(a). Let a clock at rest at the point 

x 9 ^ 0 in 27 emit ticks at t = 0 and t = 1 jc corresponding to 

m = 0 and m = 1 respectively. The ‘world points’ corresponding 
to these events are at O and D respectively in Figure 6.7(a). In order 
to measure the co-ordinates and times of these events relative to X', 



Figure 6.7. The phenomenon of time dilation, (a) Two events occurring at the point x = 0 
l f at J. lme ( m =0 and m — \ are represented by the ‘worldpoints’ 0 and D respectively. 
Inthe tnangU ODF, the angles ODF and OFD are n/2 + <f> and „/2 - 2<f> respectively, 
so that ; OF = OD cos d> sec 2<p, where tan = v/c. (b) Two events occurring at the origin 
of 2 , at m — 0 and m — \ are represented by the ‘ world points' 0 and E respectively 


draw the Ox' and Om axes appropriate to the relative velocity v, 
such that tan ef> = v/c. The unit of m! in X' is given by OE. In 
order to obtain the m' co-ordinates of the event at D, draw a line 
through D parallel to the Ox' axis to cut the Om' axis at F; thus 
OF represents the difference in m = ct' values between the events 
at O and D in the inertial frame X'. Now the unit of m' in 27' is 
given by OE. Since in Figure 6.7(a) OF > OE the time interval 
between the two events, which occur at the point x ~y = z ~ 0 in 
27, is longer in 27' than in X illustrating time dilation. [Actually, OF 
represents a value of m' in 27' equal to OF/g’ t = cos <£ sec 2<f>/g' 2 = y.] 
Let events corresponding to x' = 0, m' = 0 and x' — 0, m == 1 be 
at 0 and E respectively as shown in Figure 6.7(b). To obtain the 
m co-ordinate of the event at E draw a line parallel to the Ox axis 
to cut the m axis at H. The unit of m = ct in 27 is given by 0 D. 
Since OH > OD, the time interval is now longer in 27 than in X' 
illustrating that the phenomenon of time dilation is reciprocal. 
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6.3. THE USE OF THE COMPLEX VARIABLE X x = i ct 

In Section 6.2 only real variables were introduced. There is an 
alternative approach in which instead of time or m = ct, the variable 
X t = i ct is introduced, where i = V— 1. Let new variables be 
introduced such that in X 

Xy = x; X 2 = yi X 2 — z; X± = i ct 

Let the corresponding quantites in S' be 

x; = x'; X' = /; X' = z'; Z' = i ct' 

From the Lorentz transformations 


x' = y(x — vt) 


Hence X[ = 7 (*i - = 7 (*i + i \ * 4 ) 

X' = V 2 

K = *3 

Now t 1 — y(t — vxjc 2 ), or i ct’ — y (id — i ^x 

Hence, 

x; = r(x,- i^,) 

Eqns (6.13) and (6.16) can be rewritten in the form 
X( — Xy cos <f> + X 4 sin <f> 

X[ = — Xy sin cf> + X 4 cos <f> 

where 


tan i j> = i v/c; cos <f> — y; 


sin <f> = iy- 


(6.13) 

(6.14) 

(6.15) 


(6.16) 

(6.17) 

(6.18) 

(6.19) 


We shall again start by considering motion in the x direction only 
and X t will be plotted against X 1 using a rectangular Cartesian 
co-ordinate system as shown in Figures 6.8(a) and ( b ). Let the 
motion of the particle P relative to the inertial frame X be repre¬ 
sented by the curve NR. We shall proceed now to see how the 
motion of P relative to X' can be represented. The X( axis of X’ 
must be represented by the line X' K = 0. It follows from eqn (6.18) 
that the line X' t = 0 is given by XJX 1 == tan cf> relative to the OX v 
OX 4 co-ordinate system. This represents a straight line making an 
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angle <f> with the X x axis. Similarly, the X' axis in E' must be 
represented by the line X{ = 0. From eqn (6.17) it follows that, 
relative to the OX x , OX 4 co-ordinate system, the line X' = 0 is 
represented by the line XJX i = —tan <j>; this line makes an angle 
<p with. the X 4 axis and an angle ( 77/2 + <f>) with the X x axis as 
shown in Figure 6.8. In the present case the X x and X 4 axes are both 
rotated in the same direction when one transforms from E to E’, 
and the axes always remain at right angles to each other. 



(a) ( b ) 

Figure 6.8. (a) NR is the ‘world line’ in the X 4 = x, X 4 = ict space. A Lorentz trans¬ 
formation corresponds to the rotation of the X x , X t axes through the same angle </>. (4) The 
relationships between the co-ordinates of‘the world point * R relative to the new axes X{ Xi and 
relative to the old axes X 1 X i 

If there is no change of gauge (or scale), then the co-ordinates of 
the point R relative to the OX x , OX 4 axes are given by 

X x = X x cos <j> + X 4 sin <f> 

X 4 = — X x sin <f> -f X 4 cos <f> 

as illustrated in Figure 6.8(b). These components are precisely the 
same as those predicted by the Lorentz transformations for the 
X[ and X 4 co-ordinates of an event which has co-ordinates X x and 
in E. Therefore the point R can represent the displacement of the 
particle P relative to both E' and E without having to change 
the lengths of the units of either X' x or X 4 . The ‘world line’ of the 
particle P is again an invariant. The Lorentz transformation 
corresponds to a rotation of the axes in the X x X 4 plane. 

The displacement OR in the X x X 4 plane has the same transfor¬ 
mation properties as the displacement vector in ordinary two dimen¬ 
sional co-ordinate geometry. For this reason it is often stated that 
the X x and X 4 co-ordinates of the point P are the co-ordinates of a 
vector in the X x X 4 plane. If the particle P moves in three dimensions 
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then the four dimensions X ± , X 2 , X 3 , X 4 are necessary to specify its 
displacement at a given time. Since according to the Lorentz 
transformations X' 2 = X 2 and *3= *3, the Lorentz transformations 
can still be represented by a rotation of the axes in the X 1 X i plane. 
The four co-ordinates of the point R are said to form the com¬ 
ponents of a 4-vector. By introducing the co-ordinate X i = ict 
the geometry of the four-dimensional X 1 X 2 X 3 X 4 space has become 
‘Euclidean’. It should be stressed that this is a purely formal 



Figure 6.9. The illustration of the phenomena of time dilation 
and length contraction using the properties of the X 1 X i space 

representation, since the co-ordinate X t is imaginary and the 
rotation in the X 1 X i plane is through an imaginary angle given by 
tan cf> = iv/c. The use of complex numbers often simplifies the 
algebra. No deep metaphysical significance should be attached to 
the presence of V—1 in some of the equations of the theory of 
special relativity. It is introduced simply for reasons of mathematical 
convenience, just as the introduction ofj = V— 1 often simplifies 
calculations in alternating current theory. 

The phenomena of time dilation and length contraction can also 
be illustrated using the ideas developed in this section. Consider 
a clock at the origin of 27 which emits signals corresponding to 
AX 4 = 1 • These events are represented by a series of crosses along 
the X 4 axis as shown in Figure 6.9. In order to obtain the X 4 ' co¬ 
ordinates of these events draw a series of lines parallel to the X[ 
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axis. The distance between each cross on the X[ axis is equal to 
AX 4 cos <f>. From eqn (6.19), cos <j> = 1/(1 - v^/c 2 )*, hence 


that is 



i c At A t 

or At ' = vT^7j? 


This represents the phenomenon of time dilation. 

If a rod of length l 0 is at rest along the X x axis of the inertial frame 
27, the world lines of its end points are represented by the lines A A' 
and BB' in Figure 6.9. To determine the length of the rod in 27', 
the ends of the rod, which is moving with uniform velocity v relative 
to 27', must be measured at the same time in 27'. If the length of the 
rod relative to 27' is measured at a time X' = 0, then its length is 
given by the difference between the intercepts cut by the ‘world 
lines AA and BB on the X^ axis, that is, by DD' as shown in Figure 
6.9. We have 


DD' = l = l 0 sec <f> = l 0 



This gives the length contraction of a moving rod. It is quite 
straightforward to show again that both time dilation and length 
contraction are reciprocal. 


6.4. THE DEVELOPMENT OF THE THEORY OF 
SPECIAL RELATIVITY USING 4-VECTORS 

In elementary textbooks a vector is defined as a quantity which has 
both magnitude and direction. The most convenient way of repre¬ 
senting a vector is to use a co-ordinate system fixed in space. The 
directions of the axes of the co-ordinate system represent fixed 
directions in space. The vector can then be represented by its 
components along the various axes. If the components of a vector 
are given, then the magnitude of the vector, and its direction in 
space can be calculated. If the axes of the co-ordinate system are 
rotated in space, the vector itself must not be changed in either 
magnitude or direction. If this is to be the case, then the com¬ 
ponents of the vector must transform in a particular way when the 
axes are rotated. If the components of any quantity transform in 
the same way as the components of a vector, when the co-ordinate 
axes are rotated in space, then that quantity must be a vector. 
Thus it is convenient to define a vector in terms of the transfor¬ 
mation properties of its components, when the co-ordinate axes are 
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rotated. A 4-vector in the ZjZ 2 Z 3 Z 4 space will be defined as a 
quantity which transforms under a Lorentz transformation, in the 
same way as the X v X 2 , *3, X 4 co-ordinates of a point in the four¬ 
dimensional space. For example, A is defined to be a 4-vector if, 
under a Lorentz transformation, 

A 'i = y[ A i + i-A^;A l t = A 2 ; A' = A s ; A' = 7 (a 4 - i-A^j 

( 6 . 20 ) 

by analogy with the equations: 


x; = y(x \ + i- e Z 4 ) ; X’ = Z 2 ; X’ = Z 3 ; Z' = y(x 4 - i-j Z x ) 

( 6 . 21 ) 

Now A? + a:’* + A’ 2 + a; 2 = y 2 (a x + i ? 4 4 j 

+ A% + Af+ y 2 ^4 4 - i ~ c A X J 

= A\ + Al + A 2 + Al (6.22) 


Hence the length of a 4-vector is unchanged under rotation of axes 
(that is by a Lorentz transformation). If the square of the length 
of a 4-vector is positive it is a space-like vector, whereas if the 
square of its length is negative it is a time-like vector. 

Returning to the position 4-vector, its four components can be 
represented by (X v X 2 , Z 3 , Z 4 ) = (r, ict) . The three components 
X x , X 2 and Z 3 are the components of a vector in ordinary three- 
dimensional space; the fourth component is equal to a scalar times 
V — 1. All 4-vectors have this property. If the components of 
another position 4-vector are X x + 6X x , X 2 + dX 2 , Z 3 + <5Z 3> 
Z 4 + <5Z 4 then 

(X x 4- <5Z() = y [z x + dX x + — (A 4 + <5Z 4 ) 

X' + 6X' = Z 2 + <5Z 2 

Z' + dX 3 = Z 3 + <5Z 3 

X; + (5Z 4 = y[z 4 + <5Z 4 - i- c (X x + SX x ) 
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Subtracting the transformations for the 4-vector (X, X X X \ 
given by eqn (6.21), 11 2 ’ 3 ’ 4 


sx; = y 
tx; = y 


SXi + i- sx 4 

dX 4 ~i- dX t 
c 1 


; 8X’ = SX 2 ; dX' = dX, 


3> 


(6.23) 

Hence the increments in a 4-vector form a 4-vector. Now the 
length of a 4-vector is an invariant, hence 

dX i 2 + 6X 2 + SX? + SX' 2 = SX 2 + 8X\ + 8X1 + 8X1 
This corresponds to the relation 


8s 2 ~ 8x 2 + by 2 + 8z 2 — c 2 8t 2 = INVARIANT 

The above relation was derived in Section 3.12, where it was shown 
that the interval between two events is an invariant. It was shown 
m Section 3.12 that when 8s 2 is negative it is possible to choose an 
inertial reference frame in which the events occur at the same point. 
1 he time interval between the events is then a proper time interval. 
Let os — ic 8t, where 8r is the proper time interval. 

8s 2 = (ic 8t) 2 = 8x 2 + 8y 2 + 8z 2 - c 2 8t 2 


8x 2 

8t 2 


+ + — 2 - c 2 - (~) 2 

x s1 2 + 8t 2 c - \^r) 


(6.24) 


v the A W ° r CntS ’ „ haVin? c °-° rdinate2 *i> X 2 , X 3 and X 4 and 
1 u v 2 ^ X 2 > X 3 -f- SX 3 and X 4 -f- 8X 4 respectively, refer 

to the positions of a particle at times t and f -f- 8t in the inertial 
frame 27, then 

Sx by 8z 


8t ~ “ 


St 8t ~ “* 


Eqn (6.24) becomes 


ifi — A 


—A 


8t 2 


8r 2 = 8t 2 (l - u 2 lc 2 ) 
or 

St = 8tV 1 — u 2 (c 2 (6.25) 

In eqn (6.25), u is the three-dimensional velocity of the particle in 
the inertial frame 27. The quantity 8r is an invariant since 8s 2 is 
an invariant. If all the components of a vector are multiplied by a 
scalar or an invariant a, then one obtains a new vector of length a 
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times the original vector. Let the 4-vectors X and X + <5X refer 
to the positions of a particle at times t and t + dt; then 

<5X = {dX lt <5X 2> dX 3 , 6X t ) 

is a 4-vector. If this 4-vector is multiplied by the invariant 1/<5t, 
then one obtains a quantity which is also a 4-vector; let this be 
denoted by U. One has 


U = 


[dx 1 sx 2 sx 3 dXA 

( <5t ’ dr ’ <5r ’ dr) 

( dx dj 

V(1 - u 2 /c 2 Jdt ’ VJT 11 « 2 /c 8 )' df ’ 


dz 


icd£ 


V(1 - u 2 /c 2 ) dt ’ V(1 - u 2 /c 2 ) dt) 


But dx/dt = u x , etc, where u is the ordinary three-dimensional 
velocity of the particle, hence 


_ /_Jf*_ _^ 

\ V1 — u 2 lc 2 ’ Vl — 


u 2 jc 2 ’ Vl - u 2 jc 2 ’ Vl - u 2 !> 


u 2 lc 2 ) 


= ( _ " _ ic ) 

Vl - u 2 lc 2 ’ Vl - U 2 lc 2 ) 


(6.26) 


The 4-vector U is called the 4-vector velocity or the 4-velocity. In 
27' the 4-velocity of the particle is given by 


U' = 


u' ic ^ 

Vl - u' 2 /c 2 ’ Vl - u' 2 /c 2 ) 


(6.27) 


where u' is the three-dimensional velocity of the particle measured 
in 27'. If U is a 4-vector it must transform according to eqns (6.20), 
hence 


. V \ 





. i. DU A 

^ . v u x 7 

Id 

to 

1 

to 

1 

. - u 2 /c 2 c Vl - u 2 jc 2 _ 

1 V1 — u 2 /c 2 


But from eqn (6.27) 




i c 

V1 — u 2 jc 2 
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1 (1 ~ vuj*) 


Vl - u' 2 jc 2 Vl - v 2 jc 2 Vl - u 2 jc 2 
This is the same as eqn (4.14). From eqn (6.26) 


(6.28) 


Ui = 


U . + i- u. 


U X , . P ic 

L 1 c 4 

y 

LVl — u 2 /c 2 C Vl - a 2 /c 2 J 


l = y 


(“* — v ) 

= y ■ _ — 

Vl - a 2 /c 2 

/ 

But from eqn (6.27), £/( is equal to - 

Vl - a' 2 /c 2 

Hence, 

, __ y(« e — p) Vl — a' 2 /c 2 


Using eqn (6.28), 


Vl - a 2 /c 2 
(“* — ») 


* (1 - vujc 2 ) 

This expression is the transformation for the x component of the 
3-dimensional velocity of the particle. 

From eqns (6.20) U' 2 = C/ 2 , hence: 

V1 - u' 2 jc 2 


Vl - u' 2 /c 2 Vl - u 2 jc 2 ’ ° r “ V l - a 2 /. 2 


Using eqn (6.28), 
Similarly, 


, _ a„y 1 — v 2 jc 2 
Uy (1 — vujc 2 ) 

, u z V1 — v 2 jc 2 

Uz ~ (i - ™y 2 ) 


This completes the velocity transformations. The length of 
4-vector must be an invariant. Now 

U 2 = U 2 + t/f + C7f + U 2 

a 2 1 - 2/-2 


(1 - a 2 /c 2 ) + (1 - a 2 /c 2 ) 

« a - c 2 _ c 2 (l - a 2 /c 2 ) 


(1 - a 2 /c 2 ) (1 - „ 2 / c 2) 
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According to the postulate of the constancy of the velocity of light, 
this must be an invariant. The 4-velocity is a time like 4-vector. 

If U is the 4-velocity of a particle at the point x, y, z at a time t 
in the inertial frame 27 and if U + dU is its 4-velocity at the point 
x + dx, y -j- dy, z + dz at a time t -f- St, then the increments 
in the components of U are the components of a 4-vector, that is 
(6U V dU 2 , dU 3 , SU 4 ) is a 4-vector. Multiplying by the invariant 
1 /St one obtains another 4-vector, A, which is called the 4-vector 
acceleration or the 4-acceleration. After some algebra one finds 


A 


dU /( c 2 __ c 2 (u . a) 1 ic 3 (u . a) \ 
d r = \\(c 2 - u 2 ) a + U (e 2 - «■)■/• (c 2 - u y) 


(6.29) 


where a is the 3-dimensional acceleration of the particle. 

In the theory of special relativity the inertial rest mass of a 
particle should have the same numerical value when the particle 
is at rest in all inertial frames, though when it is moving the mass of a 
particle depends on the velocity of the particle. Multiplying the 
4-velocity by the invariant rest mass m 0 one obtains 


“ ”“ u - (vr=%p' vnSp) “ <p ' imc) (6 - 30> 


P is called the 4-vector momentum or the 4-momentum, whilst 
p = m 0 u/(l — « 2 /c 2 )* is the 3-dimensional momentum, and m = 
m 0 l( 1 — u 2 /c 2 ) 1 is the relativistic mass. At this stage one can 
proceed precisely as in Section 5.3 defining the 3-dimensional force 
f as dp/d t, and defining the work done in a displacement dl as 
f. dl. Equating the work done to the increase in kinetic energy, 
it was found in Section 5.3 that 


at e A , dT f dl f 

d T = f. dl or -r- = f. ~r = f • u 

d£ at 

Integrating, as in Section 5.3, one obtains 


T = 


m a c* 


V 1 — « 2 /c 2 


— m n c‘ 


or 


me 2 = T 4- m 0 c 2 = E 
Substituting for me 2 in eqn (6.30), 

P = (p, i Etc) 

The corresponding 4-vector in 27' is 

P' = (p', i m'c) = (p', i E'jc) 
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The length of a 4-vector is an invariant, therefore 

P 2 + (iT/c) 2 = constant 
or 

*2 E 2 

P — -^2 = constant 

When p = 0, E = m 0 c 2 so that the constant is equal to -my, so 
E 2 = /> 2 c 2 -j- 

Since the length of a 4-vector is an invariant, in Z' one has 
P' 2 ~ E'*y = 

£' 2 = / V 2 + m 2 ( -4 


giving 


To obtain the transformations for the momentum of a single particle 
use can be made of the fact that (p, iE/c) is a 4-vector. For the 
hrst component, one has, from eqn (6.20) 


K 


or 


'M*) 


P'. = y(pb + “ iElcj = y(p x - v’Ey) 
From eqn (6.20), 

Similarly ^ " F ‘ *“”» 

P 3 ; -Pa gives /»' = p z 

The relation P t = y (p 4 — — P^j gives 


iE'/c 


or 


'K - 7 #-) 


These transformations are the same as those derived in Section 5.5. 

Now from eqn (6.32) it follows that dP = (dp, i~) i s a 

4-vector. Multiplying by 1/dr one obtains the Minkowski 4- 
dimensional force F, which is defined by the relation 


F = — = **&) = ( _dp 

dr Idr’cdr/ \V (1 — u 


d E 


V(1 — u 2 /c 2 )dt ’ c V (1 — « 2 /c 2 )df/ 
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If the rest mass of the particle is an invariant, using eqn (6.30) 

dU 


F = m, 


0 dr 


(6.34) 


(The rest mass is not always invariant, for example the rest mass 
of a rocket burning up fuel and emitting gases is reducing con¬ 
tinually.) 

Now 

d E d ,, dT 

d»“5( r +"« e! ) S 


From eqn (6.31), dTjdt = f . «, where f is the 3-dimensional force, 


dp 

which is defined as f = -jj . Hence, 


F 


I f if«it ^ 

\ V1 — u 2 jc 2 ’ cV 1 — u 2 /cv 


(6.35) 


The corresponding 4-vector in S' is 

F _ (__£ _ ) 

Wl - «' 2 /c 2 ’ cV l - u’ 2 lc 2 / 

Now for the first component of a 4-vector, from eqn (6.20) one has 


(6.36) 


That is, 




f’ 

J x 


VI - u’ 2 lc 2 


= y 


fx 


LVi - u 2 lc 2 


»i f. u 

c c V 1 — u 2 lc 2 . 


f’ = 

J x 


Vl — u' 2 jc 2 


fx - +/*«, + /,«,) (6-37) 


Vl — v 2 lc 2 V 1 — u 2 jc 2 
But from eqn (6.28), 

Vl - u' 2 lc 2 _ 1 

Vl - 4Vl -u 2 \c 2 

Eqn (6.37) becomes 


0 - 5 ) 
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that is 


fm =fx ~ 
Since, from eqn (6.20), F' 


VU Jy 

(C 2 - VU X ) 


f 'Uzfz 

(r 2 — vu x ) 


(6.38) 


f v _ fv 

Vl — u' 2 jc 2 ~ Vi - uVc 2 

Using eqn (6.28), 

/ l — u 2 j c 2 _ / 1 - v 2 jc 2 

V 1 - u 2 lc 2 V (1 - vujc 2 ) 2 

Hence, 


Similarly, 


cW 1 - r 2 /c 2 
h (<?-vu x ) f » 

cW{\ - v 2 /c 2 ) , 
( C 2 - 


(6.39) 

(6.40) 


This completes the transformations for the three-dimensional force. 
The use of 4-vectors is a concise alternative approach to the rela¬ 
tivistic mechanics of a single particle, the equation of motion being 
given by eqn (6.34). If it is postulated that in point collisions or 
particle decays 

2 Pj = constant = Z P ( (6.41) 

where P s and P, are the 4-momenta of the ith particle, before and 
after collision, then writing eqn (6.41) into components, 


^fix Px> ZP v — £P v l £p z — Ep z \S— = 27—- or ZE = EE 

c c 

Thus the law of conservation of linear momentum and the law of 
conservation of mass-energy are both included in eqn (6.41). 

Some people prefer to solve problems in relativistic mechanics, 
by writing all the equations down as 4-vector equations. One can 
extend the theorems of vector and tensor analysis to four dimensions, 
and these techniques often simplify the working out of problems! 
For these techniques as applied to electromagnetism see Rosser 1 , 
Chapter 10. A reader interested in the development of the theory 
of special relativity using 4-vector methods is referred to Relativity, 
The Special Theory, by J. L. Synge, and to Muller’s The Theory of 
Relativity. 

In this section the imaginary fourth component was used. The 
theory developed so far could have been developed, precisely as 
above, if we had defined a 4-vector to be a quantity having four 
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real components x, y, z, m = ct which transform according to the 
equations 

v \ / 

*— -my, y =y\ z! = z; m' = y(m ——I (6.42) 

The proper time dr is again invariant. The full development of the 
theory of special relativity in terms of 4-vectors using real variables 
only, and its extension to the theory of general relativity, requires 
the use of the notation and the results of differential geometry. 
This requires an extensive mathematical preparation beyond the 
scope of the present work. The interested reader is referred to such 
books as the one by Panofsky and Phillips. The theory of special 
relativity can be extended to electromagnetism, using X 4 = i ct as 
the fourth component of the 4-vectors and it is shown by Rosser 1 , 
Chapter 10 that ordinary Cartesian tensors, which can be developed 
simply from matrix theory, suffice in this case. 

6.5. DE BROGLIE ‘WAVES’ 

According to eqn (6.32) the three-dimensional momentum p and 
the energy E of a particle can be combined to form the 4-vector 
(p x , p v , p x , iEjc) = (p, iEjc) . Under certain conditions a photon 
behaves like a particle, and it was illustrated in Section 5.8.3, 
how a momentum p = n hvjc can be associated with a photon of 
energy E = hv, where v is the frequency, h is Planck’s constant and 
n is a unit vector in the direction of motion of the photon. In the 

case of a photon, the 4-vector (p, iEjc) can be rewritten as 

ihv\ . . . 

— I. Now consider a single photon moving in the xy plane of the 

inertial frame 27, and in the x'y' plane of 27', such that the direction 
of motion of the photon makes angles of 6 and 8' with the x and x' 
axes of 27 and 27' respectively. From eqn (6.20), for a 4-vector A 

a 'i~y(a 1 + ^ 4 ); A’ = A 2 ; 

Applying these equations to the photon 4-momentum, one obtains 
v' cos 6' = yv{ cos 6 — vjc) 
v' sin 8' = v sin 8 

v' = yv{ 1 — (vjc ) cos 8) 

These are precisely the same as eqns (4.28), (4.29) and (4.30) 
respectively, which were derived in Section 4.4 when the trans¬ 
formation of plane waves in vacuo was considered. This illustrates 
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how the photon and wave models yield precisely the same results 
for the Doppler effect and aberration. It also shows that the 
direction of propagation and frequency of a plane wave combine 
to form the 4-vector (nr, iv), where n is a unit vector in the direction 
of propagation of the plane wave, and v is the frequency of the 

wave. Multiplying by hjc, one obtains the 4-vector 

where h is Planck’s constant. Thus, in the case of light the mo¬ 
mentum and energy of a photon can be combined to form the 
4-vector (p, i E/c), and the wavelength and frequency of a plane 
monochromatic wave can be combined to form the 4-vector 



( nh iAiA 

T’T/‘ 


De Broglie suggested that it might be possible to associate 
‘waves’ with particles of finite rest mass. If this were so, by analogy 
with light, one would expect the momentum and energy of the 
particle to combine to form the 4-vector (p, iE/c), whilst the 
wavelength and frequency of the particle ‘waves’ might be expected 


to combine to form the 4-vector 


/nh ihv\ 

\T’ T/’ 


where n is a unit 


vector in the direction of propagation of the particle ‘waves’. De 
Broglie suggested that Planck’s relation E = hv might be valid for 
particles, in which case the ‘wave’ 4-vector can be re-written as 

(~, Comparing with the 4-vector (p, iE/c), one would then 

expect that p should be equal to n hjX, or 


A = h/p (6.43) 

This is the de Broglie relation; it has been confirmed by experiment. 
It would be beyond the scope of the present text to discuss the nature 
of de Broglie ‘waves’ and the extension of the theory to quantum 
mechanics. The interested reader is referred to a textbook on 
quantum mechanics. The de Broglie relation was not deduced 
from the theory of special relativity in the sense that it follows 
logically from the principle of relativity and the principle of the 
constancy of the velocity of light. What has been shown is that the 
relation A = A/p is consistent with the theory of special relativity. 
The phase velocity v ph of de Broglie waves is given by 

h E E m 0 c 2 m 0 u c 2 

vh P k P 'V 1 — « 2 /c 2 V I — u 2 /c 2 u 
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Hence, the phase velocity of particle ‘waves’ exceeds the velocity 
of light. On the other hand, the group velocity v g is given by 


dr d [hv) 

d E 


v >~ ( 
V 


dp 


d 

m 0 c 2 

d 

m 0 u 

da 

L(1 - a 2 /c 2 )*J 

da 

L(1 - a 2 /c 2 ) * J 


m 0 u 

m 0 



(1 - a 2 /c 2 )* ' (1 - a 2 /c 2 )* 


Hence, the group velocity of de Broglie waves is equal to the 
velocity of the particle. 


REFERENCES 

1 Rosser, W. G. V. An Introduction to the Theory of Relativity. 1964. London; 
Butterworths 


recommended reading 


Minkowski’s original article is translated in The Principle of Relativity, a 
collection of original papers published by Dover Publications Inc. Good 
accounts of the geometrical representation of the Lorentz transformations 
will be found in Bom’s Einstein's Theory of Relativity, French’s Principles of 
Modem Physics, Eddington’s Space, Time and Gravitation, and Ahoroni’s The 
Special Theory of Relativity. For accounts of a simple geometrical method 
of representing the Lorentz transformations between two inertial reference 
frames only, the reader is referred to R. W. Brehme ( Amer. J. Phys. 30 (1962) 
489, 31 (1963) 517) and to F. W. Sears {Amer. J. Phys. 31 (1963) 269). 

For the development of the theory of special relativity using 4-vecto:rs, 
the reader is referred to M0ller’s The Theory of Relativity, to Classical Electricity 
and Magnetism by Panofsky and Phillips and to Relativity, The Special Theory 
by J. L. Synge. 

PROBLEMS 

Note : In all the calculations on 4-vectors, the reader should assume that 
the 4-vectors are defined acording to eqn (6.20). This is consistent with 
M0ller’s The Theory of Relativity. 

Problem 6.1 —Show that x' 2 + y' 2 + z' 2 — c 2 !' 2 is transformed into 
x 2 4- y 2 + z 2 — c 2 t 2 by the transformation 


x' — x cosh a — ct sinh a; y' —y; z' = z; 
ct' = —x sinh a + ct cosh a. 


Show that if a = tanh -1 vie = \ log e — —- 

{c -v) 

axe the same as the Lorentz transformations. 


the above transformations 


216 


Pufixz. TfatLLiijcjti.tiAjxz.tLL ~PlLy.Al.tzA. 




PROBLEMS 


Problem 6.2 —Show that x' 2 + y' 2 + z' 2 — A' 2 is transformed into 
x 2 + y 2 + z 2 — c 2 t 2 by the transformation 

x' = x cos <f> + id sin <j> 

f =r, z' = z; 

id' — —x sin <j> + ie< cos (j> 

Show that, if tan <j> = i vjc, the above transformations are the same as the 
Lorentz transformations. 

Problem 6.3 —Explain the use of the Minkowski diagram using x and 
m — d as variables. What is the significance of world lines ? What are the 
calibration hyperbolae ? 

Discuss in terms of a Minkowski diagram the concepts of (a) length 
contraction; ( b ) time dilation. Use geometrical methods to calculate the 
formulae for length contraction and time dilation. 

Problem 6.4 —Explain how the Lorentz transformations can be represented 
as a rotation in the X x — x, X 2 = y, X 3 = z, X x = icZ space. 

Discuss the phenomena of length contraction and time dilation in terms 
of a space-time diagram using X x and X x as variables. 

Problem 6.5 —Show that the length of a 4-vector defined by eqn (6.20) is 
an invariant. What is the square of the length of (a) the 4-velocity; ( b ) the 
4-momentum? 

Problem 6.6— In Section 3.12 an interval ds was defined to be a time-like 
(space-like) interval if d* 2 + dy 2 + dz 2 — c 2 dl 2 was negative (positive). 
Show that the 4-velocity and 4-momentum for a single particle are time-like 
vectors, and show that the 4-acceleration and the 4-force for a single particle 
are space-like vectors. [Comment: It is instructive to consider the 4-vectors 
in the reference frame in which the particle is instantaneously at rest. In 
this reference frame, the fourth (or time-like) component of the 4-acceleration 
and 4-force are both zero, since both are proportional to the 3-velocity of the 
particle. This shows that these 4-vectors are space-like. It is always possible 
to choose a reference frame in which the fourth component of a space-like 
vector is zero. Conversely, it is possible to find a reference frame in which 
the first three components of a time-like vector are all zero.] 

Problem 6.7— -The scalar product of two 4-vectors can be defined by the 
relation 

A . B = A X B X + A 2 B 2 + A 3 B S + A X B X 

Show that the scalar product, so defined, is an invariant, that is, show 

A'. B' = A . B. 

Problem 6.8 —Show that the 4-velocity and the 4-acceleration of a particle 
are orthogonal, that is, show that U . A is zero. [Hint: Use direct sub¬ 
stitution, or start from the fact that U. U = — c 2 and differentiate with 
. . TT dU 

respect to t giving U . — = 
dr 

Problem 6.9 —The Minkowski 4-force on a particle is defined by the 

, . „ dP 

relation F = —— . 

dr 
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Show that if the rest mass of the particle is a constant, then F. U =0, 
whereas if the rest mass is not constant F . U = —c 2 ——^ [Hint: F = 

, , j»y dr 

— (m 0 U) = U -^2 + m Q — . Form F . U and remember U . U = — c 2 ]. 
dr dr dr 

Problem 6.10 —Show that if r(t) is the position vector of a moving point 
in a co-ordinate system 2, then dr = (di 2 — dr 2 /c 2 j * is an invariant for 
all inertial frames 2 in uniform relative motion, and deduce that the 
Minkowski velocity and acceleration are, respectively, ( yu , iye) and 
(y 2 (a + u0), i cy 2 6), where u and a are the Newtonian 3-velocity and 
3-acceleration, y = (1 — tpjtp)~i and 6 — y 2 u . a/c 2 . 

Hence, or otherwise, show that if aQ is the acceleration in the rest frame 
(in which the velocity is zero instantaneously) the corresponding acceleration 
a in the frame in which the particle has velocity u is given by 

a = ^ j a o - (« • a o) u j ( Exeter 1961 ) 

Problem 6.11 —A particle of rest mass m 0 describes the parabolic trajectory 
x — ut,y — %at 2 , z = 0 in the inertial frame 2. Find the four components 
in the -F-system of the Minkowski 4-force vector, and the corresponding 
Newtonian force on the particle. [Hint: Use eqn (6.35). As a check for the 
Minkowski 4-force F, we have F = m 0 A, where A is the 4-acceleration. 
Comparing with Problem 6.10, one can write F = (m o y 2 u0, m 0 y 2 (a + atO), 
0, i m 0 cy 2 6) where y 2 = 1/(1 — u 2 /c 2 — fl 2 l 2 /c 2 ); 0 = y 2 a 2 //c 2 .] 

Problem 6.12— Define the Minkowski force (G, i H) on a particle, and 
show that 

G = f 0 - (1 - y) (v . f 0 )v/r 2 ; H = y(v . t 0 )jc 

where v is the velocity of the particle in the 2 frame, f 0 is the Newtonian 
force in the rest frame and y = (1 — t> 2 /c 2 )~h 

Deduce that in rectilinear motion, the Newtonian force in the 2 frame 
(G/y) is independent of v. 

A particle of rest mass m 0 is acted on by a constant force m 0 f along the 
axis Ox and is at rest at 0 at time t = 0. Find the distance it will travel 
in time f, and show that its momentum when it has travelled a distance 
x is m 0 (2xf + x^j^lc 2 ) 1. (Exeter 1961). 

Problem 6.13 —Show that the de Broglie wavelength associated with a 
particle of rest mass m 0 moving with velocity u is, 

A = h( 1 — M 2 /c 2 )f/m 0 u 

Problem 6.14 —An electron is accelerated from rest through a potential 
difference of V volts. Show that its de Broglie wavelength expressed in 
Angstroms is 



Problem 6.15 —In Chapters 3 and 6 continuous world lines were used on 
space time diagrams to represent the motions of particles. According to 
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the uncertainty principle the positions of atomic particles are generally 
indeterminate and their precise positions unknown. How do you reconcile 
these viewpoints? (Comment: According to quantum mechanics the 
precise position of a particle is only known at the world points where its 
position is determined by experiment. Between these world points the 
position of the particle is indeterminate. However, the indeterminate 
behaviour between known world points is not inconsistent with special 
relativity, provided the wave equation used to describe the system satisfies 
the principle of relativity, when the co-ordinates and time are changed 
according to the Lorentz transformations. This is true of Dirac’s equation 
for an electron.) 
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RELATIVISTIC ELECTROMAGNETISM 


7.1. INTRODUCTION 

When students are taught electromagnetism for the first time, it is 
inevitable that the individual laws such as Coulomb’s law, Ampere’s 
circuital law and Faraday’s law of electromagnetic induction are 
introduced separately. As a result many students tend to think of 
these laws as describing completely independent phenomena, and 
very few are aware of even the relative orders of magnitude of the 
electric and magnetic forces between moving charges, let alone the 
intimate connection between the electric and magnetic forces 
between moving charges. 

In Section 7.2 the ratio of the electric to the magnetic forces in a 
simple case will be calculated using formulae familiar to most 
advanced students at school. The example will then be discussed 
taking Coulomb’s law and the transformations of the theory of 
special relativity as axiomatic. This will illustrate the unity of 
electromagnetism in a vivid way. 

It was illustrated in Chapter 2 how the theory of special relativity 
evolved from optics and classical electromagnetism. In the theory 
of special relativity, Einstein extended the principle of relativity to 
optics and electromagnetism. For the benefit of readers familiar 
with Maxwell’s equations, it will be shown, in outline, in Section 
7.3 that Maxwell’s equations do obey the principle of relativity, 
if the co-ordinates and time are transformed according to the 
Lorentz transformations. This Section can be omitted, as the 
results are not used elsewhere. A reader interested in a more 
comprehensive discussion of relativistic electromagnetism is referred 
to Rosser 1 , Chapters 7, 8, 9 and 10 and to Rosser 2 . 

7.2. FORCES BETWEEN TWO PARALLEL CONVECTION 
CURRENTS 

Consider two infinitely long, straight, thin, uniformly charged, 
non-conducting wires in vacuo lying in the xy plane and moving in 
the positive x direction with uniform velocity v relative to an inertial 
frame 2 as shown in Figure 7.1(b). Let A be the electric charge in 
coulombs per metre length of the wires, measured relative to 2. 
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Figure 7.1. The electric and magnetic forces between two convection 
currents, (a) The charge distributions are at rest in 2': (b) the 
inertial frame 2 moves with a velocity —v relative to 2'. In 2 
the charges move with velocity v 


As the wires are moving relative to 27, the charge distributions give 
rise to convection currents of magnitude Av amperes respectively. 

Application of Gauss’ law and Ampere’s circuital law gives the 
electric and magnetic fields at a distance r from the wire P. 


Ey 2t rs 0 r 


and 


■ _ _ Poho 

‘ 2-nr 2-rrr 


(7.2) 


The electric field gives rise to a repulsion in the y direction of 
X z j2vs 0 r N/m length of the wire Q. The magnetic field gives rise 
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to an attraction of N/m, which is equal to /u 0 /. 2 v 2 /2Trr. The 

resultant force on the wire Q per metre length is given by 

f v =Je lee + Jinag 

A 2 flpPv 2 

— 2ire 0 r 2-nr 


A 2 

2tte 0 t 


(1 - / i 0 e 0 v 2 ) 


Numerical substitution of 8-85 X 10 -12 for e 0 and 4ir X 10~ 7 for 
/u 0 shows that fi 0 s 0 is equal to 1 jc 2 , where c is the velocity of light. 
Hence, 


/v = 


A 2 

2iT£ 0 r 


(1 - y 2 /c 2 ) 


(7.3) 


In this simple case the ratio of the magnetic force to the electric 
force is » 2 /c 2 , where v is the velocity of the charges relative to the 
inertial frame of reference chosen. 

In a typical case, say a current of 1 A flowing in a copper wire 
of cross-sectional area 1 sq. mm, the average drift velocity of the 
electrons is /-^lO -4 m/sec. If v is the mean drift velocity of electrons 
the current is navq, where n is the number of charge carriers per unit 
volume, a is the area of cross-section and q is the charge on each car¬ 
rier. Assuming that in copper n ~ 8-5 X 10 28 free electrons/m 3 , and 
using q = 1-6 X 10~ 19 coulombs one finds v ~ 7-3 X 10“ 5 m/sec. 
Hence the ratio of the average magnetic forces between the moving 
electrons in such a current to the electric forces between them is 
~d0 -25 . However, in a copper conductor there is no resultant 
electric charge as the charges on the moving electrons are com¬ 
pensated by the positively charged copper ions, which are virtually 
at rest in a stationary conductor. For an electrically neutral 
conductor the electric forces on the moving electrons is com¬ 
pensated by the electric force on the positive ions, and the resultant 
force on the conductor is due to the relatively small magnetic 
forces between the moving charges. 

The above example will now be considered from the viewpoint 
of the theory of special relativity. According to this theory, the 
laws of electromagnetism should have the same mathematical form 
in all inertial frames of reference. Consider the inertial frame Z' 
moving with uniform velocity v relative to Z along the positive x 
axis. In this reference frame the wires and the electric charge 
distributions are at rest as shown in Figure 7.1(a). Let the electro¬ 
static charge distribution in Z' be measured to be A' coulomb/metre 
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and let A' be made up of n discrete charges per metre of magnitude 
q coulombs each. In S' Maxwell’s equations reduce to Coulomb’s 
law of force between electrostatic charges. At the position of the 
wire Q the electric and magnetic fields due to the wire P are given 

uy 

and B ' = 0 (7 - 4) 
The force on one of the charges on the wire Q is equal to 


/; = o; /; = 


Jq_. 

2iTe 0 r’ 


/; = o 


(7.5) 


It will now be assumed that the transformation formulae derived in 
Sections 5.5 and 6.4 for the force acting on a single particle are 
correct. From eqns (5.77), (5.78), (5.79), when u' = 0, 


fx fx (7.6) 

fv = V(T - I m (7.7) 

fz = - v 2 jc 2 )f; (7.8) 

Substitution from eqns (7.5) into eqns (7.6), (7.7) and (7.8) gives 

fx — fz — 0 


(7.9) 

The value of the charge q is assumed to be invariant under Lorentz 
transformation. We have omitted the transformation of time from 
S to S, namely 

/ _ *' + w'/c* 

_ Vi - » 2 /c 2 

If the forces are taken to be simultaneous for all x' in S', they are 
not simultaneous for all x in S; but we are assuming steady con¬ 
ditions independent of time. The length r is unchanged under 
Lorentz transformation, as it is measured in the y direction. 

Since the wires are moving in the inertial frame S, a length of 
wire measured to be l 0 when the wires are at rest in S', is reduced to 
z o(! — flc 2 )* when measured in S. Hence the number of charges 
per unit length measured in S is greater than in S', as illustrated in 
Figures 7.1 {a) and ( b). The number of charges per unit length in S 
is given by the equation 

” ~ Vi - v 2 j<? ( 7 ‘ 10 ) 

223 


~Pufejc. TVtyixej. 



RELATIVISTIC ELECTROMAGNETISM 


Hence 


A = nq = 


A' 


V 1 — v 2 /c 2 V 1 — b 2 /c 2 


(7.11) 


The force/metre on the wire Q, measured in 27, is equal to the 
number of charges per metre length, measured in 27, times the force 
on each charge, that is using eqn (7.9) 


= ~— V1 — v 2 lc 2 since nq — X 
2tte 0 t 

Substituting for X' from eqn (7.11), one finds that the force/metre 
length of Q measured in 27 is equal to 


A 2 

2tte q t 


(1 - v 2 jc 2 ) 


This is in agreement with eqn (7.3). 

This example illustrates how the magnetic forces produced by 
currents can be calculated in some cases from Coulomb’s law for 
the forces between electrostatic charges, if the transformation 
formulae for force given by the theory of special relativity and the 
principle of constant charge are taken as axiomatic. The magnetic 
forces can be interpreted as relativistic effects and represent those 
parts of the transformed forces which depend on the velocity of the 
test charge relative to the observer. For a full discussion of this 
approach the reader is referred to Rosser 2 . 


7.3. RELATIVISTIC INVARIANCE OF MAXWELL’S 
EQUATIONS 

7.3.1. Introduction 


The macroscopic electric and magnetic fields, both in empty space 
and inside stationary material bodies, are adequtely described by 
Maxwell’s equations, which in an inertial frame E', in which the 
materials are at rest, take the form 


curl ' E ' = 

(7.12) 

curt- H- - + J' 

(7.13) 

div' D' = p' 

(7.14) 

div' B' = 0 

(7.15) 
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In eqns (7.12)-(7.I5), E' is the electric intensity, D' the electric 
displacement, B the magnetic induction, H' the magnetizing force, 
p the true macroscopic charge density and J' the true macroscopic 

tHe qUantities a PP ear ing in eqns 
(7.12), (7.13), (7.14) and (7.15) apply to the same point z ' at a 
timet in the inertial frame 27'. All the differential coefficients in 
thei equations “* ^ rCSpeCt t0 y ’’ z ' or t> > for example, eqn 

3*' + 3/ + 3z' _ 0 

h he - P ° S i! UlateS i. 0f the the0ry of Special relat ivity is that the 
laws of physics have the same mathematical form in all inertial 

the^W T 7 nC r • If Ma u xwell ’ s equations are correct and obey 
the principle of relativity, then in the inertial frame 27 moving with 
velocity v in the negative Ox' direction relative to 27', one should 

curl E = — (7.16) 


3B 

dt 


, „ 3D 

curl H = —- 4- T 
dt 

div D = p 
div B = 0 


(7.17) 

(7.18) 

T . . , (7-19) 

lull ®9 uatlons should hold for a point *, ;,zata time , in 27, 
where x,y,z and i are related to *',/, z' and *' by the Lorentz 
transformations. It will be assumed throughout Section 7 3 that 27' 
moves with uniform velocity » relative to 27 along their common * 
axis, so that v and y = (1 — d 2 /c 2 )-* are constants. 

7.3.2. Transformation ofdjdx, djdy, d/dz and d/dt 

Consider a function F which is a function of *',/ z ' and t' in 27' 
The total differential of F is ’ 


dF a / i dF j , dF dF 

*' dx + dy dj> + H? dz + 54 dt' 


^ ~ dx' ax I- df w + 3? d2 ' + ap dt ' (7.20) 

Now for a given event, and f are all functions of *, y, z and 

t. The total differential of x can be expressed as 


, , 9*' , dx ' dx' dx’ 

~^ dx + W dj + dI dz + li dt 


dx ” 1 dy^^Tz 
According to the Lorentz transformations 


(7.21) 


x'=y(x-vt); y =y; z' = z 
t' = y(t - vxjc 2 ); y = (1 _ v y c 2yi 
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Remembering that v and y are constants, we have 
dx' dx' dx’ _ n 3x _ 

3x _ y; dv ~~ ° 5 dz ' ’ dt 


dx “ By ~ 3z 
Substituting in eqn (7.21) 

dx’ = y dx — yv dt 

Similarly, 


(7.22) 

dy' — dy (7.23) 

dz' = dz (7.24) 

at' dt' dt’ , dt' , 

d< ' = ^ d * + F dj + ^ 

= — ^ dx + y dt (7.25) 

c 2 

Substituting for cU', dy', dz' and dt' from eqns (7.22), (7.23), (7.24) 
and (7.25) respectively into eqn (7.20), we have 

dF dF , dF , dF/ 7 V , \ 

dF =(y dx - yv dt) + ^ dy + gp dz + gp(y dt - j dxj 

Rearranging, 

‘“'=(’’ 1 ?-?w) dx + W dj ’ + % Az 

+ {y d 4~y4) 61 < 7 - 26 > 

But, if.F is a function of x,_y, z and t, the total differential of F can be 
written as 

+ + + (7 - 27) 

Comparing the coefficients of dx, dy, dz and dt in eqns (7.26) and 
(7.27) we obtain 


a /a v a\ 

d^c = Y [d?'~?dFj 


_a = _a 

dy dy' 

d_ __a 

3z 3z' 

Wi 
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7.3.3. Transformations for E and B 
The y component of eqn (7.16) is 

dz dx dt 

aaffreipfcttely!^ ^ “ d ^ ^ eqnS (7 ' 30)) (7 ' 28 ) and 
52 y U*' c 2 3*7 _s 0?) 


dx' y ( E ‘ + vB J = 


d v 

“h ^2 4) 


(7.32) 


Rearranging, 

dE_ x 

dz' dx' ' v “* 1 - g? 

If Maxwell’s equations are to be Lorentz covariant, that is invariant 
in mathematical form in all inertial frames of reference, then in S' 
one must have „ . 

dz' dx’ dt' ( 7 - 33 ) 

Eqns (7.32) and (7.33) have the same mathematical form showing 
that thejx-component of eqn (7.16) is Lorentz covariant. In fact if 
one puts f 

K = y{E, + vB y ) 


K = y(b v + - 


then eqns (7.32) and (7.33) are exactly the same. 

It is left as an exercise for the reader to show that the x and z 
components of eqn (7.16) transform into the *' and z' components 

,°n^ qn {7 -}n\^rT^ ™ nd t0 Sh ° w that ec 5 n ( 7 -!9) transforms 
into eqn (7.15), if E and B transform according to the following 
equations (Reference: Rosser 1 , Section 8.2): S 


K 


K = y( E v — vB z ) 
K = y{E, + vb v ) 

b'=b„ 


K = y[s v 

5 * = r(^ 


+ 


5*) 

-? E ) 


E X = E’ X 

E v = y(E’ v + vB' t ) 

E. = y(E' z - vB' v ) 

E»=K 

By 

= )-(*:+3 s;)J 


B, 


(7.34) 


(7.35) 
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These transformations relate the numerical values of the electric 
and magnetic fields at a point x,y, z at a time t in E with the values 
of the electric and magnetic fields at a point y', z at a time t' 
in E', where x, y, z and t are related to x', y', z! and t' by the 
Lorentz transformations. For a full account of the applications of 
eqns (7.34) and (7.35) the reader is referred to Rosser 1 , Chapter 8. 

7.3.4. Transformation of Charge and Current Densities 

We shall start with a simplified model and consider a ‘uniform’ 
electric charge distribution of volume V in the inertial frame E, 
consisting of n discrete charges per cubic metre, of magnitude q 
coulombs each. It will be assumed that all the charges have the 
same velocity u relative to the inertial frame E. For this simplified 
case the charge density is 

p — nq (7.36) 

The current density, which is the current crossing unit area normal 
to the direction of current flow, is given by 

J = nqu. (7.37) 

or in components, J x = nqu x ; J y = nqu y ; J x = nqu z . 

Let the same charge distribution have a volume V when measured 
relative to an inertial frame E' which is moving with uniform velocity 
v relative to E, and let it consist of n' discrete charges/m 3 moving 
with velocity u' relative to E'. One then has 

p' = n'q (7.38) 

J' = n'qv! (7.39) 

The principle of the invariance of total electric charge will be taken 
as axiomatic in the present section. According to this principle, 
the total charge on a body is independent of the velocity of the body, 
and the total charge has the same numerical value in all inertial 
reference frames. In Section 5.4, it was assumed that the magnitude 
of a moving charge in the expression for the Lorentz force was 
independent of the velocity of the charge. It was shown that this 
assumption was in agreement with experiments. Hence, the same 
value for q is used in eqns (7.36), (7.37), (7.38) and (7.39). 

Consider an inertial frame E° in which the charge distribution is 
at rest; E° moves with velocity — u relative to E. Let V 0 be the 
proper volume of the charge distribution in E°. Let n 0 be the 
number of charges/m 3 measured in E°, such that the total number 
of charges is equal to n 0 V 0 . Due to length contraction, the volume 
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of the charge distribution in 27 should be measured to be 

F 0 (l - 

The total number of charges measured in 27 is equal to nV, that is, 
n V 0 (\ — « 2 /e 2 )*. But the total number of charges is an invariant, 
since it is a pure number. Hence, 

nV = nV 0 V (1 - a 2 /c 2 ) = n 0 V 0 
or 


Similarly in 27, 


V{1 - a 2 /c 2 ) 


so that 


From eqn (4.14), 


V(l - «' 2 /c 2 ) 

/ (1 ~ « 2 /c 2 ) 


(1 - «' 2 /f 2 ) 


/(l-^ 2 ) (1 - vuj c 2 ) 

V (1 - «' 2 /c 2 ) V (1 - » 2 /c 2 ) 

Substituting in eqn (7.40), one obtains 
, n(l — vujc 2 ) 

Multiplying by q. 

Substituting from eqns (7.38), (7.36) and (7.37), 

p “ y(p ~ ^r) 

Now from eqn (7.39), 

r/ / f 

•4 = 

But from the velocity transformations, one has 

“*=(“* — »)/(l — vujc 2 ) 
Substituting for a^, and for n' from eqn (7.41), 

= (i _ Hujc*j n y( l - vu *i ci ) 


(7.40) 
(4.14) 

(7.41) 
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Substituting from eqns (7.39), (7.36) and (7.37) 

J 'x = Yi J x — vp) 

Similarly, from eqn (7.39) 

J l II 

v — 

From the velocity transformations, 

, u v V (1 — v 2 jc 2 ) 

Uy ~ (1 -vujc*) 

Substituting for u' v and 

V(1 — v 2 lc 2 ) 

J , = <1 U V _ V ujc 2 ) ny{l ~ = Jy 

Similarly 

J'x = J, 

Collecting the transformations, 

J'x = y(-4 - «p) -4 = y{J'x + V) 

= J, Jy= Jy (7.42) 

4 = 4 4=4 



It is left as an exercise for the reader to extend the treatment to 
systems containing both positive and negative charges and to show 
that eqns (7.42) are valid in the general case. (Reference: Rosser 1 , 
Section 8.4.) 

7.3.5. The Transformations for D and H 

Consider the y component of the equation 

9D 

curl H = J + -zjj (7.17) 

that is 

dH x dH z dDy 

dz dx dt 

Substituting for d/dz, djdx and d/dt from eqns (7.30), (7.28) and 
(7.31) and for J v from eqn (7.42), 
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or 


dH m d 

17-dprM-” 0 ’* 






(7.43) 


If Maxwell’s equations were valid in E', one would have 

dHL _ 3JK _ « , 

dz’ dx' v ^ ■ dt' 


(7.44) 


Eqns (7.43) and (7.44) have the same mathematical form, and if one 
puts 

K = H.i K = y(H z - vD y ); JD; = y (d v - H^j 
then they are exacdy the same. 

It is left as an exercise for the reader to show by similar methods 
that the x and z components of eqn (7.17) transform into the x' 
and z' components of eqn (7.13) and that eqn (7.18) transforms 
into eqn (7.14), if J and p are transformed using eqns (7.42), and 
if H and D satisfy the transformations (Reference: Rosser 1 , Section 
8.5): 

K = D X 

D 'v = y(^v 

K = y(H y + vD z ) 

K = y{H z - vD v ) 

It has been shown that if Maxwell’s equations are valid in E, 
then if the co-ordinates and time are transformed using the Lorentz 
transformations, and if one takes the principle of constant charge as 
axiomatic, then the transformed equations have the same mathe¬ 
matical form as Maxwell’s equations would have if they were valid 
in E'. This is true whether there are material bodies present at the 
point or not. Thus Maxwell’s equations satisfy the principle of 
relativity when the co-ordinates and time are transformed according 
to the Lorentz transformations. 

Only the briefest insight into the relativistic invariance of 
Maxwell’s equations has been given. For a more comprehensive 
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D X = D' X 


D v = y[D' v + v 7i H^ 
D.=y(D' z -j 2 H'^ 

H X = K 

H v = y(K - vD' z ) 

H, = y{K + vD’ v ) 


(7.45) 
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account of relativistic electromagnetism, including an account of 
the applications of the transformations, the reader is referred to 
Rosser 1 , Chapters 7, 8, 9 and 10 and to Rosser 2 . 
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PROBLEMS 

Problem 7.1 —Starting from Maxwell’s equations, derive the formulae 
for the transformation of the electric and magnetic field vectors E and B 
from one inertial reference frame to another moving with uniform velocity v 
relative to the first. Hence, show that the electric field at a point P in 
empty space due to a charge of magnitude q, at O moving with uniform 
velocity v is given by 

E = (1 — /3 3 ) (1 - P 2 sin 2 0)-*, ft = vie 

T7 TGqT** 

where r is a vector from O to P and 0 is the angle between r and v. Show 
that the magnetic field at P is given by 

B = v x E/c 2 

Problem 7.2 —A charge of magnitude q coulombs is moving with uniform 
velocity u in the inertial reference frame 27. In the inertial frame 27' moving 
with velocity u, relative to E, the charge q is at rest, and the force on it is 
?E'- Use the force transformations and the transformations for E and B to 
show that the force on the particle in E is qE + ?u x B. 

Problem 7.3 —Show by direct substitution that the following quantities 
are invariants under a Lorentz transformation: (a) E 2 — c 2 B 2 ; (b ) B . E; 
(c) H 2 - c 2 !) 2 ; (rf)H.D; (e)B.H-E.D; (/) eB . D + (1/e) E . H. 
[Hint: Show that B' . E' = B . E etc. by using the field transformations. ] 
Problem 7.4 —The electric field of an infinite line charge of magnitude 
V C/m at rest along the x' axis of the system E' is given by E x = B x — 

o; 

F ' £/ . F > *'z' 

v 2 we 0 (y* + z' 8 ) ’ z 2 rre 0 (y'* + z'*) 

Use this result to determine the magnetic field of a linear current i along the 
x axis of the 27-system. What is the electric field in this instance ? Compare 
the magnitudes of the electric and magnetic forces on a test charge q moving 
with velocity u at the point x, y, z. 
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8.1. INTRODUCTION 

The clock paradox concerns the readings of an accelerated clock. 
Ever since the introduction of the theory of special relativity, it has 
been the subject of great controversy. For the reasons elaborated 
at the end of this Section, in the author’s opinion, in themselves 
without further assumptions the Lorentz transformations do not say 
what the readings of an accelerated clock should be. The approach 
adopted from Section 8.2 onwards will be similar to the approach 
adopted in Chapter 5, when the variation of mass with velocity was 
developed. In Chapter 5 the theory of special relativity was used 
as a heuristic aid to ‘discover’ the law describing the variation of 
mass with velocity. This law was then compared with experiment. 
In this Section, the clock paradox will first be introduced in the 
conventional way in terms of the Lorentz transformations, so as to 
give the reader an introduction to the historical background to the 
controversy, before proceeding to the analysis of the problem in 
Sections 8.2, 8.3 and 8.4. 

Consider two clocks A and B. Let clock B be at rest in a laboratory 
on the earth and let it be at the origin of an inertial frame E. 
Effects due to the rotation of the earth will be neglected. Let the 
other clock, clock A, be at rest in a rocket which is moving with 
uniform velocity v relative to the earth. It will be assumed that at 
this stage of the journey the rocket is at rest at the origin of an 
inertial frame E'. Let clock B have the same properties relative to 
the laboratory as clock A has relative to the rocket. Both clocks 
can be calibrated in terms of some atomic frequency. Let the 
rocket leave the earth at t — t' = 0 when the origins of E and E' 
coincide, and let clocks A and B be synchronized at this instant. 
Let the rocket travel with uniform velocity v until it reaches a clock 
G in outer space, which is at rest relative to the earth as shown in 
Figure 8.1(a). Let clock C be synchronized with clock B which is at 
rest on the earth, using light signals as specified by Einstein (cf. 
Section 3.8). Let the clocks B and C read (t £)bc an d let clock 
A on the rocket read (t R ) bc when the rocket coincides with clock C. 
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X( Inertial frame 
Laboratory frame) 




The rocket moves with uniform 
velocity until it reaches clock C 


‘ --4- 

-;~®-V 

(a) 


Clock C is at rest 
in JT 


Clock B is at 
rest on the earth 



The rocket turns around at clock C 
and returns with uniform velocity 
to clock B, which is at rest on the 
earth 




(sy- 


i 

i 

i 

b) 


I 


/ JT’fmoves with velocity 
1 u v —►relative to.a ) 

V *' 

The rocket is at rest in S’' on the 
first half of the journey 


(b) 


y\ 


..removes with velocity 
y-»-relative toZ’ ) 




The rocket is at rest in 2? on 
the second half of the journey 


(C) 

Figure 8.1. Clock A on the rocket goes on a journey into outer space. The rocket goes with 
uniform velocity v until it reaches clock C, which is at rest relative to the earth; the rocket 
then turns around and returns to the earth with uniform velocity v 


According to the Lorentz transformations, for the origin of Z‘ 



(**)bc = (t L ) BC V(1 - *■/*•) = (^)bc(I - M‘ 2 ) (8.1) 

(^z#) bc — ((r)bc — |(r 2 /r 2 ) (*2 ,)bc (8.2) 
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Thus according to the Lorentz transformations clock A on the 
rocket should read less than clock C, which is at rest relative to 
the earth. The time interval (<^)bc measured by the clock on the 
rocket is a proper time interval, whereas the difference between 
the readings of clocks B and C, which are both at rest relative to the 
earth, is not a proper time interval; eqn (8.1) is the expression 
for time dilation. 

We shall now consider what may happen if the rocket turns 
around at the posidon of clock C and returns with uniform velocity 
v back to the earth. In order to turn around the rocket must undergo 
an acceleration. It will be assumed that the rocket turns around 
quickly and then returns to the earth with uniform velocity v. On 
the return journey the rocket is at rest in a different inertial frame, 
say £", from the inertial frame £' in which it was at rest on the 
outward half of the journey. The inertial frame £” is moving with 
uniform velocity v along the negative Ox axis relative to £ as shown 
in Figure 8.1(c). Once the acceleration is over, the transformations 
of the theory of special relativity can be applied again. The time 
interval read by clock A on the rocket (^b)cb f° r the time for the 
rocket to go from C to B, after the acceleration is over, is again a 
proper time interval, whereas the difference between the readings 
of clocks G and B when the rocket passes them, namely (t£) CB , is 
an improper time interval and we should have 


( 4 ) 


CB 


(8.3) 


or 


(<t)cB - V(1 -v*W) 

(^r)cb = (*&) cb'V / (1 — v 2 /c 2 ) — (tj^j cbO |w 2 /c 2 ) (8.4) 


What will the readings of clocks B and A be when they coincide at 
the end of the journey ? In his paper of 1905, Einstein 1 wrote: 


From this there ensues the following peculiar consequence. If at the 
points A and B of K there are stationary clocks which, viewed in the 
stationary system, are synchronous; and if the clock at A is moved with 
the velocity v along the line AB to B, then on its arrival at B the two 
clocks no longer synchronize, but the clock moved from A to B lags 
behind the other which has remained at B by ^tipjc 2 (up to magnitudes 
of fourth and higher order), t being the time occupied in the journey 
from A to B. 

It is at once apparent that this result still holds good if the clock moves 
from A to B in any polygonal line, and also when the points A and B 
coincide. 

If we assume that the result proved for a polygonal line is also valid 
for a continuously curved line, we arrive at this result. If one of two 
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synchronous clocks at A is moved in a closed curve with constant velocity 
until it returns to A, the journey lasting t sec, then by the clock which has 
remained at rest the travelled clock on its arrival at A will be sec 

slow. Thence we conclude that a balance clock at the equator must go 
more slowly, by a very small amount, than a precisely similar clock 
situated at one of the poles under otherwise identical conditions. 

Einstein’s suggestion is generally referred to as the clock hypothesis. 
According to Einstein, one can add eqns (8.1) and (8.4) to obtain 
the total time for the journey measured by clock B which is at rest 
on the earth, namely (^bcb — *z> and the time (4) bob = 4 
measured by clock A which is at rest on the rocket. Following 
Einstein, one obtains 

(*k)bcb = (fz^BCB^U — v 2 /c z ) ^ (^xJbcb^I — £ (8-5) 

or 

4 = t L V (1 - f 2 /c 2 ) — <i(l — \ ^ (8.6) 

Thus, if Einstein’s suggestion were correct, the clock on the rocket 
should read less at the end of the journey than the clock which 
remained on the earth; this is one example of the clock paradox. 

In the above discussion the Lorentz transformations were used 
directly. The calculation is sometimes carried out using the ideas 
developed in Section 6.2. For the benefit of those readers who have 
familiarized themselves with Section 6.2, a brief account of this 
approach will now be given. The motion of the rocket relative to 
the earth is represented graphically in Figure 8.2. The displacement 
of the rocket relative to the earth, that is, x, is plotted against 
m = ct L using a rectangular co-ordinate system. On the outward 
half of the journey clock A in the rocket moves with uniform velocity 
v relative to S, and its ‘world line’ is represented by the line OP 
which makes an angle <f> = tan -1 vjc with the m axis and an angle 
( 77/2 — <f >) with the x axis, as shown in Figure 8.2. The rocket 
coincides with clock G at the ‘world point’ P. The world line of 
clock B, which remains at rest on the earth, is represented by the 
m axis in Figure 8.2, and clock B is at the ‘world point’ Q at the 
time t L j2 (relative to the earth), when clock A coincides with 
clock G at the ‘world point’ P. The distance QP, which is equal to 
vt L /2, represents the displacement of the rocket at the time t L / 2 
(relative to the earth). Draw a hyperbola through P parallel to the 
calibration hyperbola corresponding to G = —1. The unit of 
time for the co-ordinate system S’ in which the rocket is at rest on 
the outward half of the journey is proportional to OP whilst the 
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iinit of time appropriate to the laboratory system E is proportional 
to OR. If the length OP represents K units of time on the rocket 
then OR represents K units of time in the laboratory. Since 
u ^°®. re P re sents a longer time in the laboratory system E 
t an OP does in the inertial frame E' in which the rocket is at rest 
on the ou tward half of the journey, so that clock C which is at rest 
m the laboratory frame should read more than clock A on the 



rocket when the rocket is at the world point P. After it has turned 
around, the world line’ of the displacement of the rocket relative 
to the earth, on the return half of the journey when it is at rest in 
an inertial frame E", is represented by the line PS. The unit of 
time m the co-ordinate system E n in which the rocket is at rest on 
the second half of the journey is proportional to OT which is equal 
*° , ‘y, J „ he un it of time in the laboratory system is still proportional 
to OR. Since SQ = OQ > OR, the time interval measured by the 
clock on the rocket for the return half of the journey, after the 
acceleration is over, is less than the time interval measured by 
clocks at rest relative to the earth. According to Einstein (1905) the 
clock on the rocket would read the time represented by OP (in E') 
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plus PS (in E") which is less than the time represented by OS in E, 
in agreement with eqn (8.6). 

So far we have only considered the case in which the rocket 
always moves with uniform velocity except when it turns around. 
Einstein’s theory can be extended to the general case, when the 
rocket undergoes a series of accelerations or undergoes a continuous 
acceleration. In the inertial frame E (the laboratory system) for 
two successive positions of the rocket at (x, y, z, t) and ( x + 6x, 
y + by, z + 8z, t -\-8t) we have 


or 


8x 2 + dy 2 + 8z 2 — c 2 8t 2 = 8s 2 = —c 2 8r 2 

8t = 8tV( 1 - u 2 jc 2 ) 


(8.7) 


In eqns (8.7), 8s is the interval between the two events corresponding 
to the successive positions of the rocket, u is the velocity of the rocket 
relative to the earth, and <5r is the proper time interval between the 
two events, so that 8r is equal to the time interval between the two 
events, measured by a clock at rest in the inertial reference frame 
in which the rocket is instantaneously at rest. According to the 
clock hypothesis, the time difference between the two events, 
measured by the clock inside the rocket, is also equal to 8r. Hence, 
according to the clock hypothesis, the rate of the clock in the rocket 
is independent of the acceleration of the rocket, and is equal to the 
rate of a clock of identical construction at rest in the inertial frame 
in which the rocket is instantaneously at rest. According to the 
quotation taken from Einstein 1 given earlier, if the rocket is accel¬ 
erating, eqn (8.7) can be integrated to give, in the general case 

^ jds = Jdr = J d/V(l - u 2 /c 2 ) (8.8) 


where u is the instantaneous value of the velocity of the rocket 
relative to the earth. The time of the journey in the laboratory 
frame is equal to J df, whilst the time measured by the clock on the 
rocket should be equal to J dr. Eqn (8.8) is the mathematical 
representation of the clock hypothesis. For the special case shown 
in Figure 8.1, when u — v except for the short period when the 
rocket is turning around, one has 


Jdr = V(1 — v 2 jc 2 ) J d< or t' B = V(1 — v 2 jc 2 ) t L (8.9) 

This is in agreement with eqn (8.6). 

Equation (8.8) can be represented geometrically; j - d« is given by 
the length OS in Figure 8.2 using the time scale appropriate to E, 
whilst for a curved ‘world line,’ representing the motion of the 
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rocket in the general case, J dr is evaluated along the curved ‘world 
line, using at each stage the time scale appropriate to the inertial 
reference frame in which the rocket is instantaneously at rest. Both 
j dr and J dy depend on the actual ‘world line’ representing the 
motion of the moving clock. Both integrals are route dependent, 
so that neither dr nor dr is a perfect differential. 

The conclusion that t B is generally not equal to t L is known as the 
clock paradox . Ever since the introduction of the theory of special rel¬ 
ativity, the clock paradox has provoked a lot of discussion, and there 
are still a large number of scientists who disagree with Einstein’s 
conclusion and suggest that if the experiment were actually per¬ 
formed one would find that t' R would be equal to t L . The clock 
paradox is often stated in terms of identical twins, one of whom goes 
on a journey into outer space and then returns to the earth, whilst 
the other twin remains at rest on the earth. Will the traveller be 
younger, that is aged less, than his twin brother who has stayed 
at home? According to eqn (8.5) he should have aged less. The 
argument has been revived recently, since there is a strong prob¬ 
ability that space travel may become feasible in the not too 
distant future, though the effect due to the clock paradox, if it 
exists wiU be small since t L ~t R ^ and it will not be 

possible to accelerate rockets to speeds comparable with the velocity 
of light, since the fuel requirements would be too heavy The 
recent controversy was started, to a large extent, by Dingle 8 and 
McCrea Dingle has maintained stoutly against all opposition 
that if the experiment were performed, there would be no relative 
aging between the identical twins and t B would be equal to t L . 

In order to avoid the emotional impact of expressing the clock 
paradox in terms of human mortality, the clock paradox will be 
discussed in terms of the readings of radioactive clocks, and in order 
to simplify the discussion, the clock paradox will be discussed, 
initially, m terms of the journey illustrated in Figure 8.1, in which 
t e rocket moves with uniform velocity v except when it is turning 
around. It is accepted by both sides in the dispute that under the 
same conditions a clock will read the same when at rest in both the 
rocket and the laboratory, e.g. to quote Bondi 4 : 

If one inertial observer makes himself a clock, driven by a clockspring, 
that ticks twice a second, then any other observer copying exactly this 
design of clock will have a device for measuring his own time in the same 
way, with two ticks a second. This is not a theorem but a definition, 
the definition of seconds for any observer. Instead of a spring driven 
clock any other device registering time could have been used, for example 
the spectral line of an atom, the time a specific acid takes to eat through 
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a specified thickness of steel, the length of a generation of rabbits on a 
rabbit farm, the decay of a radioactive element, the aging of human 
beings, and so forth. 

The clock paradox will be discussed in terms of the rate of decay of 
a radioactive element. If one started with N 0 radioactive atoms 
at rest on the earth and N 0 identical atoms at rest in the rocket at 
the instant the rocket leaves the earth, then when the rocket returns 
to the earth, according to the law of radioactive decay, the number 
of radioactive atoms remaining in the laboratory (denoted iV LAB ) 
should be given by 

^LAB = (8.10) 

where t L is the time interval measured by a clock at rest in the 
laboratory. Similarly, the number of radioactive atoms remaining 
in the rocket when it returns to the earth (denoted by Brocket) 
should be given by 

-Brocket — jV 0 e _i (8-11) 

where t' R is the time interval measured by a clock at rest in the 
rocket. It is assumed that the radioactive substance has the same 
mean lifetime T 0 when the radioactive substance is at rest in both 
the laboratory and in the rocket. If, as Einstein suggests, t R were 
less than t L then one would find experimentally that -iVaocKET 
would be greater than whilst if Dingle’s view proved correct 

one would find that -Brocket would be equal to iV LAB . It should 
be noted that if Einstein’s view proved to be correct, then there 
would be a physical difference between the box of radioactive atoms 
which remained on the earth compared with the box of radioactive 
atoms which went on the journey in the rocket, since a higher 
proportion of the radioactive atoms in the box in the laboratory 
would have decayed than in the box in the rocket. When the rocket 
turns around it must undergo an acceleration and be transferred 
from one inertial reference frame to another, so that the clock 
paradox involves what may or may not happen to a clock which is 
at rest in a co-ordinate system which is sometimes accelerating 
relative to the fixed stars. 

An attempt is now made to try and state, as objectively as possible, 
some of the arguments put forward in defence of their viewpoints by 
the supporters of both sides in the controversy. The main argument 
of those who say that, if the experiment were actually performed, 
that t L would be equal to t R is roughly as follows. According to 
the theory of relativity two co-ordinate systems in uniform relative 
motion are equivalent, so that clock A in the rocket may be regarded 
as being at rest with clock B on the earth moving relative to it. 
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If the acceleration of the rocket has no effect on the clock in the 
rocket, then the motion of clock B on the earth relative to clock A 
on the rocket should be precisely the same as the motion of the clock 
A on the rocket relative to the earth considered previously, except 
that the velocity of the ‘moving’ clock is in the opposite direction in 
the two cases. If it is assumed that clock B on the earth is moving 
relative to the rocket, then clock B on the earth measures the 
proper time intervals between the events corresponding to successive 
positions of the earth relative to the rocket, whereas the clocks 
‘stationary’ in the co-ordinate system in which the rocket is at rest 
measure improper time intervals. If the arguments used previously 
are applied, exactly as before, one should now conclude that 

t L = V(1 - »•/<*)** (8.12) 

In this case t L should be less than t' R whereas previously it was 
concluded that t L should be greater than t' R . Both these conclusions 
cannot be correct. Dingle and his supporters suggest that, provided 
the effects associated with the acceleration of the rocket can be 
neglected, the way out of the paradox is to conclude that if the 
experiment were performed one would find that t L would actually 
be equal to tR, so that there would be no relative aging between 
the two identical twins. This is also a hypothesis about the behaviour 
of an accelerating clock. 

Those who uphold the alternative view that, if the experiment 
were performed, t' R would be found to be less than t L , start by 
pointing out that the clock in the rocket undergoes an acceleration, 
whereas the clock at rest on the earth does not. (The effects due 
to the rotation of the earth are being neglected.) When one is in a 
train or in a car one knows when one undergoes sudden accelera¬ 
tions or de-accelerations. The experiences of somebody undergoing 
accelerations are different from the experiences of somebody who 
does not undergo accelerations. The ‘experiences’ of the clock on 
the rocket and the clock on the earth are not the same but are 
different, since one of the clocks, namely the clock on the rocket, 
undergoes an acceleration. It is suggested that the clock on the 
accelerating rocket may be changed physically during or as a result of 
the acceleration. It is suggested that, due to the different experiences 
of the clocks, it would not be unreasonable to find that the readings 
of the radioactive clocks on the earth and on the rocket, measured 
in terms of the fraction of undecayed radioactive atoms, differed 
when the rocket returned to the earth. For the reasons to be 
elaborated in Sections 8.2, 8.3 and 8.4, they claim that the clock 
hypothesis, that is eqn (8.8), is correct. 
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To illustrate the role of the Lorentz transformations, we shall 
review how the variation of mass with velocity was developed in 
Chapter 5. In themselves, the Lorentz transformations do not say 
anything about the mass of a moving body. If a body is moving, 
then its positions at various times relative to any inertial frame S 
can be determined using rulers and synchronized clocks at rest 
relative to X. Similarly, the successive positions of the body relative 
to any other inertial frame S', moving with uniform velocity 
relative to S, can be determined using rulers and clocks at rest 
relative to S'. It is the co-ordinates and times of the event of the 
measurement of the position of the moving body at some identifiable 
point on its path, determined by clocks and rulers at rest in S and 
S' respectively, which are related by the Lorentz transformations. 
By themselves, the Lorentz transformations say nothing about the 
mass of the body. Velocity is a kinematic quantity which can be 
determined from the readings of the rulers and clocks at rest 
relative to S and S' respectively. Momentum was defined as the 
product of mass and velocity, the expression for the variation of 
mass with velocity being unknown at this stage. It was then shown 
that, if the law of conservation of momentum was to hold in all 
inertial reference frames, when co-ordinates, times and velocities 
were transformed using the Lorentz transformations, then mass had 
to be redefined such that the mass of a particle of rest mass m 0 
moving with velocity u was m 0 /{ 1 — u 2 /c 2 )L The theory of special 
relativity was used as a heuristic aid to help to select the appropriate 
law for the variation of mass with velocity. This law was then 
compared with experiment. 

Similarly, if there is a clock in an accelerating rocket, the successive 
positions of the rocket relative to any inertial frame S can be 
determined using rulers and synchronized clocks at rest in S. 
Similarly, the successive positions of the rocket relative to any other 
inertial frame S', (which is and continues to move with uniform 
velocity relative to S) can be measured using rulers and syn¬ 
chronized clocks at rest, relative to S'. The rocket is accelerating 
relative to both S and S'. Consider any identifiable point on the 
path of the accelerating rocket. It is only the co-ordinates and time 
of such an event, measured by rulers and clocks which are, and 
remain, at rest relative to S and S' respectively, which are related 
by the Lorentz transformations. Without making some further 
assumption such as the clock hypothesis or the Dingle hypothesis, 
one cannot deduce what the readings of the clock inside the 
accelerating rocket should be, since the clock in the rocket is 
accelerating, from one inertial frame to another. Our approach in 
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Section 8.2 will be to consider a gedanken experimenU and use the 
postulates of the theory of special relativity as a heuristic aid to 
sdect the appropriate law describing the behaviour of an accelerated 
clock. Then in Sections 8.3 and 8.4 these predictions will be 
compared with what experimental evidence is available The 
reader should approach the rest of this Chapter with an open 
mind, without any preconceived intuitive misconceptions, and base 
is final opinion on the theoretical and experimental evidence 
presented to him. 


8.2. A GEDANKEN EXPERIMENTS ON THE CLOCK 
PARADOX USING THE DOPPLER EFFECT 
The example to be discussed was given by Darwin 6 and expanded 
by Bondi, Scott, Builder 7 and others. It is again assumed that a 
rocket goes on a journey from the earth into outer space with uniform 
velocity v, that the rocket then turns around quickly and comes back 
to the earth with uniform velocity v, as illustrated in Figure 8.1. 
Let the time of the journey be measured to be t L in the inertial 
reference frame in which the earth is at rest and let it be measured 
to be t R in the co-ordinate system in which the rocket is at rest 
I he rocket turns around at times t L j 2 and 4/2 relative to the earth 
and rocket respectively. The X-calculus methods of Appendix 6 
are used. 

Let signals be sent out from the earth at intervals of l/v„, where 
ro is the frequency of the signals measured relative to the earth. 
(One can imagine each successive maximum of a monochromatic 
fight wave, emitted by a source of light at rest on the earth as a 
signal.) Let the frequency of these signals be measured to be 
v v 0 f A relative to the rocket, when they reach the rocket when it 

is moving away from the earth towards clock G (which is at rest 
relative to the earth), as shown in Figure 8.1(b). The source of light 
on the earth is moving away from the observer, who is on the rocket 
in t is instance. Let the rocket re-emit signals without time delay 
every time it receives a signal from the earth. These secondary 
signals have a frequency v relative to their source (the rocket). If 
the velocity of light is independent of the velocity of the source of 
light, the secondary signals from the rocket must be recorded at a 
frequency v» = Kv' by an observer at clock C, at rest relative to the 
earth, just as if there had been no intermediate stage. In this case 
t e source of fight on the rocket is approaching the observer near 
clock C. This shows that if the frequency of the light received from 
a source moving away from an observer changes by a factor 1 IK 
t en, if the velocity of light is the same in all directions, the 
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frequency of light from a source approaching an observer must 
change by a factor K. Thus, if a rocket emits signals of frequency 
v' (relative to the rocket) and if the frequency of these signals 
measured by an observer on the earth is v’/K when the rocket 
is moving directly away from the earth, then the frequency of the 
signals received on the earth emitted from the rocket when the 
rocket is approaching the earth with the same speed should be Kv'. 
This is an example of the Doppler effect. It will be shown that the 
value of K agrees with eqn (4.36). (Cf. also Appendix 6 >j.) 

Consider the motion of the rocket relative to the earth for the 
journey shown in Figure 8.1. It will be assumed that light signals are 
sent out from the rocket at intervals 1 /v’, where v' is the frequency of 
the signals measured relative to the rocket. (One can imagine each 
successive maximum of a monochromatic light wave, emitted by a 
source of light at rest on the rocket as a signal.) These signals will 
travel with a velocity c in empty space and reach the earth at a 
later time. Let the frequency of light received on the earth, and 
emitted when the rocket is moving away from the earth be, v'jK. 
When the rocket turns around at a time t L j2 relative to the earth, 
and at a time t' R /2 relative to the rocket, it will be at a distance 
vt L /2 from the earth, measured in the co-ordinate system in which 
the earth is at rest. The light emitted by the rocket when it is 
approaching the earth will have a frequency Kv' relative to the 
earth. It will take a time vt L j2c after the rocket turns around, 
before the light of the new higher frequency reaches the earth 
from the rocket. The change in frequency will be observed on the 
earth at a time t L J2 + vt L /2c (measured by a clock at rest on 
the earth). Since the rocket travels for a time t' B l2 (relative to the 
rocket) before turning around and the light signals are emitted from, 
the rocket with a frequency v' (relative to the rocket) during this 
time interval, the number of signals leaving the rocket between the 
instant it leaves the earth and the instant it turns around is equal to 
v't' R l2. The time interval between the arrival of successive signals 
on the earth (measured in the co-ordinate system in which the earth 
is at rest) is equal to K/v’. Thus between the time the rocket leaves 
the earth and the time tJ2 + vt L j2c when the signals of the higher 
frequency start reaching the earth, the observer on the earth will 
receive v't' R j2 signals, spaced a time interval Kjv' sec apart, so that 



(8.13) 


The observer at rest on the earth will receive signals at a frequency 
Kv' from the time (t L /2)(l + vjc) until the time t L when the rocket 
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returns to the earth, that is, for a time (£ X /2)(I — v[c). These 
signals will be emitted by the rocket between the instant of turning 
around and the instant when the rocket returns to the earth. 
During this time interval (which is 4/2 relative to the rocket) the 
source on the rocket will emit v'4/2 signals. Since the time interval 
between the time of arrival of successive signals on the earth is now 
equal to l/Kv' (measured relative to the earth) one has 


r'4 

2 \ ~c) ~ 2Kv' 


or 



(8.14) 


Multiplying eqns (8.13) and (8.14), and taking the square root 
of both sides one obtains 


4 = t L V(l - v */r 2 ) (8.15) 


This is in agreement with eqn (8.6). Dividing eqn (8.13) by 
eqn (8.14) one finds that K is equal to [(c + y)/( c — a )]i i n agree¬ 
ment with the relativistic theory of the Doppler effect. 

It will now be assumed that light signals are also sent out from 
the earth to the rocket at a frequency v 0 (measured in the co- 
ordinate system in which the earth is at rest). When the rocket is 
moving directly away from the earth, according to the theory of 
special relativity the frequency of the light signals received by the 
rocket is equal to r 0 [(c — v)/(c + o)]i = vJK. This is the frequency 
of the light signals received by the observer on the rocket until the 
instant the rocket turns around. Once the rocket has turned around 
the frequency goes up to v 0 [(e + v)/(c ~ a)]* = Kv 0 . Thus the 
observer on the rocket receives signals of frequency v 0 fK until the 
instant the rocket turns around at a time tj2 (relative to the earth) 
at a distance vt L j2 away from the earth (the distance being measured 
m the co-ordinate system in which the earth is at rest). The light 
reaching the rocket at the instant when it turns around will have 
left the earth at the time t L f2 — vt L /2c (measured relative to the 
earth). The number of signals emitted from the earth in a time 
(4/2) (1 — v/c) (measured relative to the earth) is equal to 


(Vi/2)(1 - vjc) 


This is the number of signals received by the rocket before tur ning 
around at a time 4/2 (relative to the rocket). Since the time 
interval between the receipt of each signal on the rocket on the 
outward half of the journey is equal to Kjv 0 , one has 


4 

2 




or 



(8.16) 


8 
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All the light signals emitted from the earth after the time 

(*z./2)(l - v/c) 

up until the time t L , that is during a time interval (t r J2) (1 + v/c ), are 
received on the rocket at a frequency r 0 [(r + v)/(c — »)]* = Kv 0 , 
such that the time interval between each signal is 1 /Kv 0 (relative 
to the rocket). Since the number of signals reaching the rocket 
on the return half of the journey is equal to v 0 (t L / 2)(1 + v/c). 



Multiplying eqns (8.16) and (8.17) and taking the square root 
of both sides, one obtains 

4 = t L V(l - v*/c*) (8.18) 

This is precisely the same as before [cf. eqn (8.15)]. Thus according 
to the Gedanken Experimente both the observer on the earth and 
the observer on the rocket would agree that t R was less than t L . 
One can see precisely how the experiences of the observers on the 
earth and the rocket differ as far as the receipt of light signals are 
concerned. The observer on the rocket observes a change in 
the frequency of the light coming from the earth at the instant the 
rocket turns around, that is, half way through the journey. On the 
other hand, the observer on the earth does not observe a change in 
the frequency of the light coming from the rocket until after half 
time, but has to wait until light emitted from the rocket, after it 
turns around, has time to reach him before he measures a change of 
frequency. It is generally accepted that once the photons con¬ 
stituting light leave the source of light, they travel with the velocity 
of light and their frequencies and energies relative to any inertial 
reference frame are unaffected by any subsequent changes in the 
motion of the source of light. It is also generally accepted that, if 
an observer changes his state of motion, he immediately observes 
a change in the energy and frequency of a photon (cf. Section 5.8.3). 
If one makes these assumptions, then it is difficult to find flaws in 
the above interpretation of the Gedanken Experimente. The conclusion 
that if the experiment were performed, one would find 

*jR = (1 — V 2 /C 2 ) 

is certainly consistent with relativistic principles. 

In order to obtain the answer t' R = t L in our Gedanken Experi¬ 
mente one could use the classical ether formulae for the Doppler 

246 


Putuc. Tfefit/tcMtatiea/ ~PpLy.A-Lc.4- 


EXPERIMENTAL CHECK OF THE CLOCK PARADOX 

effect, but it was shown in Section 3.10 that these formulae were 
not consistent with the experiment of Ives and Stilwell (1938). If 
the relativistic formulae for the Doppler effect are accepted, one 
would have to make some other assumptions in order to obtain 
the answer t' R = t L . For example, Dingle 8 has suggested that the 
frequency of the light reaching the earth from the rocket might 
change at the instant the rocket turns around. However, such a 
change in the frequency of the light would imply a change in the 
energy of the photons, and this would imply instantaneous action 
at a distance, which is not consistent with the theory of special 
relativity, since, according to relativistic mechanics, energy changes 
cannot be transmitted with a velocity exceeding the velocity of 
light. 

The above interpretation of the gedanken experiment strongly 
favours the clock hypothesis, eqn (8.8). We shall now proceed to 
consider possible experimental checks of the clock hypothesis. 


8.3. A POSSIBLE EXPERIMENTAL CHECK OF THE 
CLOCK PARADOX 

It was pointed out in Section 8.1 that if one started with N 0 
radioactive atoms at rest on the earth and N 0 radioactive atoms at 
rest in the rocket at the instant the rocket leaves the earth on the 
journey illustrated in Figure 8.1, then according to the law of 
radioactive decay 

N lab = N 0 e-U T ° (8.19) 

■Brocket = A 0 e~ ( V 7 " 0 (8.20) 

where jV lab and Arqcket are the number of undecayed radioactive 
atoms left on the earth and on the rocket respectively at the instant 
the rocket returns to the earth; t L and t' R are the times for the 
complete journey measured by clocks on the earth and the rocket 
respectively, and T 0 is the mean life of the radioactive atoms 
measured in the co-ordinate system in which the radioactive atoms 
are at rest. If t' R were less than then one should find experi¬ 
mentally that Brocket would be greater than jV IjAIi . A possible 
radioactive clock would be a box of 7r-mesons which should decay 
with a mean life of 2-55 X 10 -8 sec, when they are at rest in both the 
rocket and in the laboratory. Instead of putting the 7r-mesons in a 
rocket, it is suggested that the 7r-mesons should be sent around a 
closed path (in vacuo) in the laboratory. Fairly monoenergetic 
beams of 7r-mesons of kinetic energy ~100 MeV or more are now 
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available. The possibility of using accelerator produced 77-mesons 
to check the clock paradox was suggested by Martinelli and 
Panofsky 9 , Cochran 10 , and others. 

The experiment suggested would not be exactly the same as the 
rocket experiment, since IVlab an< i -Brocket would not be measured 
at the same time. The proposed experiment will, however, be 
interpreted by comparing it with the rocket experiment, and it is 
hoped that it will be accepted as being equivalent to the rocket 
experiment. 

The experimental arrangement suggested is shown in Figure 
8.3. It is suggested that a collimated beam of 77-mesons be 
passed through the scintillation counter and after deflection in a 


TT-mesons 



C 2 


Deflecting 

magnetic 

field 


Figure 8.3. A possible experimental arrangement to test the clock paradox. 


magnetic field passed through a second scintillation counter C 2 
placed close to Cj and coincidences between C x and C 2 detected. If 
necessary the 77 -mesons could be sent back through the same crystal 
C v If the intensity of the 77-meson beam were low enough, the 
77-mesons, would generally come one at a time and the number of 
chance coincidences would be small. The /i-mesons arising from 
77- pi decays would not generally be emitted in the same direction 
as the parent 77-mesons so that 77-mesons, which would undergo 
77-/1 decay before reaching C 2 , would not give rise to coincidences 
between counters C x and C 2 , except when the pi- meson happened to 
be emitted within the solid angle subtended by C 2 . (A correction 
could be applied for these stray coincidences.) The number of 
77-mesons which started the journey would be given by the number 
of single counts of counter C 1 (denoted by N 0 ), whilst the number 
of 7 r-mesons which completed the journey (denoted by iV R0CKET ) 
would be given by the number of coincidences between Cj and C 2 , 
(after correcting for any focusing losses, stray coincidences, etc.). 
Instead of starting with N 0 77-mesons moving along the path shown 
in Figure 8.3 and observing how many remained when they 
reached C 2 , which would be directly equivalent to sending them in a 
rocket, it is suggested that N 0 successive 7 r-mesons be observed 
going one at a time around the path shown in Figure 8.3, and 
the number reaching C 2 observed. If ‘monoenergetic’ 77-mesons 
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were used, then N 0 successive experiments using one 77-meson at a 
time should be equivalent, when added together, to starting with 
N 0 77 -mesons and doing the experiment once only. This experiment 
is equivalent to sending N 0 77 -mesons on a journey in a rocket and 
counting how many did not decay before the end of the journey. 
This experiment gives N’ R0CKJS , r . 

The time delay in the laboratory between the light pulses in C, 
and C 2 due to 7r-mesons, which complete the path shown in Figure 
8.3 and pass through both scintillation counters, should be 
measured. Such a time interval could be measured accurately by 
a time to pulse height converter of the Green and Bell 11 (1958) type, 
which could measure time intervals down to 10 _1 ° sec. For a total 
path length of four metres traversed at a velocity of 2 X 10 8 m/sec 
the time interval would be 2 X 10~ 8 sec. This time delay gives 
t LAB , the time of flight of the 7r-mesons (or ‘rocket’) from C x to C 2 
measured relative to the laboratory. Since the mean life of stationary 
charged 77 -mesons has been shown to be 2-55 X 10~ 8 sec, one can 
then use eqn (8.19) to calculate what number JV LAB of an initial 
number JV 0 of 77 -mesons stationary in the laboratory would remain 
after a time interval f LAB , the time of the ‘rocket’ journey from C 1 
to C 2 measured relative to the laboratory. 

The experiment consists of comparing the values of iV R0CKET and 
iV LAB . If Dingle’s view were correct one would expect N LAR to be 
equal to iV ROCKBT , corresponding to equal time intervals on the 
clocks at rest in the rocket and in the laboratory, that is, t L — t' R as 
determined from eqns (8.19) and (8.20). On the other hand, if 
Einstein’s view were correct, iV R0CK:BT should be greater than jV lab 
corresponding to t L > t' R . If t L were equal to yt' R [as predicted by 
eqn (8.6)], where y = (1 — p 2 /c 2 ) -i , v being the velocity of the 
77 -mesons, then t L lt' R should be equal to T73 for 7r-mesons of kinetic 
energy 100 MeV and 4-5 for 7r-mesons of kinetic energy 500 MeV. 
According to Dingle’s view the ratio should be equal to unity in 
both cases. The experiment suggested should be accurate enough 
to decide between the two theories. 

It was described in Section 3.10.4, how experiments such as those 
of Durbin, Loar and Havens 12 confirmed the dilation of lifetimes, 
when 77 -mesons travel in straight lines. According to the clock 
hypothesis, one should measure the same dilation of lifetimes, 
relative to the laboratory, when 77-mesons travel in curved paths, 
for example in a circle in a magnetic field or in the path shown in 
Figure 8.3, even if the 77-mesons do return to their starting point. 
An observer travelling with the 77 -mesons should still, however, 
measure their lifetime at rest to be T 0 = 2-55 x 10~ 8 sec. 
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8.4. EXPERIMENTS ON THE TEMPERATURE 
DEPENDENCE OF THE MOSSBAUER EFFECT 

It was suggested by Sherwin 13 that experiments on the temperature 
dependence of the Mossbauer effect in 57 Fe are a check of the 
clock paradox. The interpretation of the experiment given by 
Sherwin is somewhat similar to the Gedanken Experimente described in 
Section 8.2. In this Gedanken Experimente it was assumed that a 
rocket going on a journey into outer space and back sends light 
signals back to the earth, where the number of signals and the time 
intervals between successive signals are recorded. A short account 
of the Mossbauer effect is given in Appendix 5. Under certain 
conditions, when an excited nucleus forms part of a crystal lattice 
when it decays, the y-ray may be emitted without nuclear recoil. 
Similarly, under certain conditions, when it forms part of a crystal 
lattice, a nucleus may absorb a y-ray of the appropriate energy 
without recoil. Recoilless emission followed by recoilless absorption 
leading to nuclear resonance fluorescence is known as the Mossbauer 
effect. It is illustrated in Appendix 5 how a measurement of the 
Mossbauer effect affords, under certain circumstances, a very 
accurate method of measuring the difference between two fre¬ 
quencies, for example using 57 Fe it is possible to measure a change 
of frequency less than 1 part in 10 12 . If the temperature of the 
source or absorber in the experimental arrangement, discussed in 
Appendix 5 and illustrated in Figure A5.4, is changed, then there 
is a change in the counting rate of the detector, due to the changes 
in the amount of recoilless emission (or absorption) due to the 
change in the temperature and hence the velocity of the source 
(or absorber) atoms. 

The atoms in a crystal are not at rest but are vibrating relative 
to the laboratory, and are therefore moving relative to the labora¬ 
tory when the y-rays are emitted. The mean square velocity of 
vibration depends on the temperature of the crystal. According to 
the theory of the Doppler effect the vibrations of the excited nucleus 
lead to changes in the energy (and frequency) of the photons emitted. 
Now, light is emitted in the form of discrete photons or quanta and 
not as a continuous wave motion. Sherwin suggests that this is a. 
point of detail. For example, he writes: 

The waves representing the photon obey Maxwell’s equations, and are 
propagated in exact accordance with requirements of special relativity 
even though they are detected by the usual discrete quantum events. 
Furthermore, in quantum mechanics, the excitation and de-excitation of 
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states by electromagnetic waves is calculated as if the process were 
continuous, that is, in a manner essentially identical to that used for 
macroscopic resonant structures. It is only in the matter of interpretation 
and observation of the calculated state amplitudes that the characteristic 
discrete quantum effects enter. 

The ‘waves’ representing a single photon can be interpreted in 
the same way as the de Broglie waves for a single particle. The wave 
theory of light can be used to estimate the probability that a 
measurement on one photon will give a particular experimental 
result. Following Sherwin, the emission of a y-ray will be inter¬ 
preted using the classical wave theory, keeping the above quantum 
mechanical interpretation in mind. According to classical theory, 
during the lifetime of the excited state the waves are emitted contin¬ 
uously. When discussing the 14-4 keV level in 67 Fe, Sherwin writes: 

Since the centre frequency v 0 is 3-46 X 10 18 c/sec, and since the mean 
lifetime of the excited state is T4 x 10~ 7 sec, there are about 5 X 10 11 
complete cycles in the (intense part of the) wave train representing the 
photon. 

For a crystal the frequency of vibration of ions is ~10 13 c/sec, so 
that a nucleus vibrates about its mean position many times during 
the lifetime of the excited state in 67 Fe. According to classical wave 
theory the motion of the decaying nucleus at any instant affects the 
frequency of the waves emitted at that particular instant. For 
simplicity it will be assumed that the excited nucleus vibrates 
inside the crystal in one dimension only, namely, along the x axis. 
It will be assumed further that the 67 Fe nucleus moves with uni¬ 
form velocity v relative to the laboratory, turns round and moves 
back with uniform velocity v until it turns around again. Consider 
one complete oscillation as shown in Figure 8.4. Classically, one 
can imagine the source nucleus emitting waves continuously, pre¬ 
cisely as in the Gedanken Experimente discussed in Section 8.2. The 
moving source corresponds to the rocket. The signals are emitted 
at a frequency v 0 relative to the source (rocket). The receiving 
nucleus measures the frequency of the waves representing the pho¬ 
tons emitted by the source nucleus, using its own characteristic 
frequency as a standard of comparison. It is being assumed for 
the present that the receiving (absorbing) nucleus is at rest. If 
the receiving (absorbing) nucleus were at position 2 in Figure 8.4, 
then it would correspond precisely to the Gedanken Experimente 
discussed in Section 8.2 and illustrated in Figure 8.1. However, 
source and absorber do not coincide at the beginning and at 
the end of the journey, but the absorber is displaced a distance x 0 
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from position 2 as shown in Figure 8.4. According to the principle 
of the constancy of the velocity of light, the velocity of the waves 
from the source is independent of the velocity of the source, so that 
the time light takes to travel the extra distance x 0 is always xjc. 
Hence, the time interval between the receipt of the first and last 
signal from the ‘rocket’ on the path shown in Figure 8.4 is the same, 


| .Motion of the 

2 1 t v / excited nucleus 
“*1 which emits a 

| V-ray 


Figure 8.4. A simplified diagram of an experiment on the temperature dependence 
of the Mossbauer effect. It is assumed that the emitting nucleus is moving back 
and forth with uniform velocity v; its path is somewhat similar to the path of the 
rocket shown in Figure 8.1(a) 



Actual position 
of the absorber 


whether the absorber is at position 2 or position 1. According to the 
theory of the Gedanken Experimente developed in Section 8.2: 

t' R = t L V( 1 - v*jc*) (8.21) 

where t' R and t L are the times for the journey, illustrated in Figure 
8.4 measured by clocks at rest on the ‘rocket’ (i.e. source) and on 
the earth respectively. Eqn (8.21) is also consistent with the use 
of eqn (8.8). If, during the lifetime of the excited state, the 
excited nucleus goes back and forth many times on the journey 
shown in Figure 8.4, then, since the journey is merely a series of 
repetitions of the one shown in Figure 8.4, one has 

4 = t L V( 1 - v*lc*) (8.22) 

where now t' R and t L are the lifetime of the excited state measured 
relative to the ‘rocket’ (source) and laboratory (absorber) re¬ 
spectively. Let N 0 be the number of maxima of the wave train 
emitted by the source nucleus. The average frequency of the waves 
relative to the laboratory is given by 


V L 


No 

4 


whereas the average frequency of the waves relative to the ‘rocket’ 
(source nucleus) is given by 


v'r = v 0 


No 

Sr 
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Hence 

V L = v' R V(l - v*lc 2 ) = v 0 V(l - V 2 j C 2 ) 

where v 0 is the mean frequency of the emitted y-ray in the inertial 
frame in which the excited nucleus is at rest. If the excited nucleus 
is vibrating in 3 dimensions with non-uniform velocity, then one 
should use eqn (8.8), namely 


or 


Jdr = Jd<V(I - u 2 /e 2 ) 

Jd4 = jdt L V(l -«*/**) 


where u is the instantaneous value of the velocity of the source 
nucleus relative to the laboratory. 

Let Av, be the shift in the frequency of the emitted y-ray when 
the nucleus is in a lattice at a temperature TgK. We have, 


Av. 

— 3 = (1 - » 2 /c 2 )* 
”o 


-v 2 /2e 2 


For the general case of non-uniform velocity, v 2 would be the mean 
square velocity of the excited nucleus. If M is the mass of the 
nucleus, according to classical kinetic theory, we would have 


Hence, 


= 3(kT s )/2 or v* = Ar£ 


Av, 

v o 


3kT s 

-£ _ _2-4 

2Mc 2 


X 


io- 15 r s 


where T s denotes the temperature of the source. 

So far it has been assumed that the nuclei in the absorber are at 
rest. In practice, they are also vibrating, their mean square velocity 
depending on T A the temperature of the absorber. The shift in the 
resonance frequency of absorption is given by 

Av j 

—- = -2-4 x 10- 15 7\ 

’'o A 

Hence the frequency difference Av between the source resonant 
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frequency and the absorber resonant frequency is given by 

— = -2-4 x 10- 15 Ar (8.23) 

where AT = T s — T A . To quote Sherwin again: 

If the source is at a higher temperature than the absorber, then An 
is negative, that is, the source has a lower frequency than the absorber. 
If resonance absorption is to occur, the absorber must be given a small 
velocity away from the source. By contrast, if the source is at a lower 
temperature than the absorber, then for resonance to occur, it is found 
necessary that the absorber move toward the source. 

According to Pound and Rebka 11 for iron near room tempera¬ 
ture, the numerical coefficient in eqn (8.23) should be about 0-9 
times the classical value, corresponding to a Debye temperature of 
467°. Hence eqn (8.23) should read 

— = -2-21 X 10- 15 AT 

v 0 

Pound and Rebka found experimentally that, 

A y 

— = -(2-09 ± 0-24) x 10- 15 Ar 

v 0 

if both the source and absorber were near room temperature. The 
agreement between the calculated and experimental results is 
satisfactory. Thus the temperature dependence of the Mossbauer 
effect can be accounted for in terms of eqn (8.8), or the Gedanken 
Experimente discussed in Section 8.2. 

Hay, Schiffer, Cranshaw and Egelstaff 16 placed a S7 Co source 
near the centre of a rotating wheel. A thin 67 Fe iron absorber was 
placed around the circumference of the wheel. The transmission 
of the absorber was measured for various angular velocities, using 
a counter at rest just outside the wheel, to detect the 14*4 keV 
y- rays. The transmission of the absorber was found to increase as 
the angular velocity increased, indicating a shift in the characteristic 
frequency of the absorber. Since the line shape for the absorber 
at rest was known experimentally, the magnitude of the frequency 
shift could be estimated, and it was found to agree with the frequency 
shift calculated using eqn (8.8) in the laboratory system. II the 
linear velocity of the absorber is v, then the characteristic frequency 
relative to the laboratory is given by 

v A = v 0 V (1 - w 2 /c 2 ) 
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where in this case the moving absorber is equivalent to the moving 
rocket. The experiment can also be interpreted in terms of the 
theory of general relativity in the rotating system (cf. Burcham 16 , 
and see Section 9.8). 

These results on the Mossbauer effect are extremely interesting 
in that they can be interpreted in terms of eqn (8.8) as described 
above. However, the interpretation of the experiments in this way is 
based on the quantum mechanical statistical interpretation of photon 
‘waves’. One has to assume this interpretation to be correct in order 
to prove that the experimental results are a check of the clock 
paradox. For this reason it is still worth while performing the 
7r-meson experiments described in Section 8.3, since these experi¬ 
ments approximate more closely to the rocket experiment illustrated 
in Figure 8.1. These experiments would also be easier to perform 
than putting clocks in satellites or rockets as has been suggested. 

8.5. DISCUSSION OF THE CLOCK PARADOX 

The clock paradox has been viewed as an extension of the theory of 
special relativity, since it involves what may happen when measuring 
apparatus such as clocks are accelerated relative to the fixed stars. 
The discussion was confined largely to the simple example illustrated 
in Figure 8.1, in which a rocket travels with uniform velocity v 
relative to the earth, turns around and comes back with the same 
uniform velocity v relative to the earth. It was shown in Sections 
8.2 and 8.3 that the view that, if the experiment were performed, 
one would find that t' R would be less than t L is not inconsistent with 
relativistic principles and with the experiments that have already 
been performed with 7r-mesons and ^-mesons. It has been shown 
by Tolman 17 and Moller 18 that the result t' R = (1 — » 2 /e 2 )*<£ is 
consistent also with the theory of general relativity (cf. Section 9.5). 
Both the equation 

j”dr = JV(1 — u 2 /c 2 ) d< (8.8) 

and the theory of general relativity have been applied to calculate 
t R and t L when the motion of the rocket is non-uniform. For a full 
discussion the reader is referred to Bondi 4 , Cochran 19 and Scott 6 . 
It does not always turn out that t' R is always less than t L . For 
example, Cochran writes: 

It is amusing to find, in view of the controversy on the aging of space 
travellers, that in the simplest form of space travel, the traveller ages 
most\ For a clock 2. which is thrown up in a uniform gravitational 
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field, moves in a straight line, and returns under its own weight will record 
a greater time than a resting clock Y with which it is synchronized at 
the point of release- • • 

Let us imagine a rocket launching platform above the earth’s atmos¬ 
phere, yet firmly attached to the earth, and two space travellers sent off 
simultaneously, one in orbit around the earth and the other vertically 
upwards. The latter is given a velocity less than the escape velocity, just 
sufficient to allow him to return at the instant when the former has com¬ 
pleted his first orbit. It would be found that the traveller in the satellite 
had aged less than a member of the launching crew, while the other 
traveller has aged more! 

The above example will be discussed in Section 9.6, after dis¬ 
cussing the effects of gravitational fields on the rates of clocks, that 
is the gravitational red shift. 


REFERENCES 

1 Einstein, A. Annin. Phys. 17 (1905) 891 

2 Dingle, H. Nature, bond. 177 (1956) 782; 178 (1956) 680 

3 McCrea, W. H. Nature, Land. 177 (1956) 784; 178 (1956) 681 

4 Bondi, H. Discovery 18 (1957) 505 

5 Darwin, C. G. Nature, Lond. 180 (1957) 976 

6 Scott, G. D. Am. J. Phys. 27 (1959) 580 

7 Builder, G. Aust. J. Phys. 10 (1957) 246 

8 Dingle, H. Philosophy Sci. 27 (1960) 233 

9 Martinelli, E. and Panofsky, W. K. H. Phys. Rev. 77 (1950) 468 

10 Cochran, W. Proc. Camb. phil. Soc. 53 (1957) 646 

11 Green, R. E., and Bell, R. E. Nucl. Instrum. 3 (1958) 127 

12 Durbin, R. P., Loar, H. H. and Havens, W. N. Phys. Rev. 88 (1952) 179 

13 Sherwin, C. W. Phys. Rev. 120 (1960) 17 

14 Pound, R. V. and Rebka, G. A. Phys. Rev. Lett. 4 (1960) 274 

15 Hay, J. J., Scheffer, J. P., Cranshaw, T. E. and Egelstaff, P. A. 
Phys. Rev. Lett. 4 (1960) 165 

16 Burcham, W. E. Sci. Prog., Lond. 48 (1960) 630 

17 Tolman, R. C. Relativity Thermodynamics and Cosmology, p. 194, 1934. 
London; Oxford University Press 

18 Moi.ler, C. The Theory of Relativity, pp. 48 and 258, 1952. London; 
Oxford University Press 

19 Cochran, W. Vistas in Astronomy. Vol. 3. p. 78,1960. London; Pergamon 
Press 

20 Grunbaum, A. Philosophy of Science. Ed. by Danto, A. and Morgenbesser, 
S. p. 399, 1960. Cleveland and New York; Meridian Books 

21 Schwartz, H. M. Am. J. Phys. 30 (1962) 697 

PROBLEMS 

Note: When answering the numerical examples which follow, the reader 
should assume that the clock hypothesis 
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is correct; r is the time interval measured by aclockmovingwithinstantane¬ 
ous velocity u relative to an inertial frame in which the same time interval 
is measured to be f 2 — t v 

Problem 8.1 —Einstein wrote: * • • • the theory becomes a valuable 
heuristic aid in the search for general laws of nature’. 

Discuss what is meant by this statement, which Einstein made about 
the theory of relativity. Give examples of the use of this method. 

Problem 8.2 —Discuss the role the velocity of light plays in the theory of 
special relativity as the limiting velocity for the propagation of signals. 
(References: Griinbaum 20 and Schwartz 21 ). 

Problem 8.2 —Review the arguments, both for and against the view that, 
if one of two identical twins went on a journey into outer space and returned 
to the earth, he would have aged differently from his twin brother who 
stayed at home. 

Problem 8.4 —A rocket accelerates quickly and then moves with uniform 
velocity relative to the solar system, until it reaches a star 8 light years away. 
A clock on the rocket records the time for the journey as 6 years. Calculate 
(a) the speed of the rocket relative to the earth and ( b ) the time of the 
journey relative to the earth, (c) If the rocket turns around quickly and 
returns to the earth with the same velocity as on the outward journey 
compare the total time for the journey relative to the earth and relative to 
the rocket. 

Problem 8.5 —On her 29th birthday a lady physicist concludes that she 
would like to remain 29 for at least 10 years. She decides to go on a journey 
into outer space with uniform velocity. What is the minimum speed she 
must move relative to the laboratory so that she can return 10 years later 
(relative to the laboratory) and still say, quite truthfully, that she is only 29 ? 

Problem 8.6 —An astronaut goes on a 24 hour journey in a satellite, 
which is in a circular orbit around the earth. If the velocity of the satellite 
relative to the earth is 10 4 m/sec, find how much less the astronaut will have 
aged than a person who remains on the ground. You may neglect effects 
associated with the change in gravitational potential at the altitude of the 
satellite. 

Problem 8.7 —A total of 2,000 charged jr-mesons are created at the origin 
of an inertial frame. Half of the 77-mesons remain at rest at the origin, 
whilst the other half go on a journey with a uniform speed of 0-995c. After 
a path length of 15 m, the travelling 77 -mesons are deflected in a magnetic 
field so that they return back to the origin, (a) How many of the moving 
7 r-mcson should survive the journey? (b) How many of the stationary 
77-mesons are left at the origin when the travelling 77-mesons return ? Take 
the mean lifetime of charged 77-mesons to be equal to 2-5 x 10~ 8 sec, when 
they are at rest. 

Compare the above experiment with the travelling twin experiment and 
show that the two are equivalent. 

Problem 8.8 —The most comfortable way of going on a space flight would 
be in a rocket which moved with proper acceleration g, that is, with 
acceleration g relative to the inertial reference frame in which the rocket 
is instantaneously at rest. The astronaut should then feel perfectly at home. 
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This is an example of hyperbolic motion discussed in Section 5.4.1, and in 
Problem 4.10. Relative to the earth, if the rocket starts from rest at t = 
r = 0, one has, as before, u the velocity of the rocket, y = (1 — !i 2 /c 2 ) - ^ and 
x the distance travelled by the rocket relative to the earth given by 


and 





x 




where t is the time of the journey relative to the earth. Assume that t, 
the time on the rocket is given by 


T 


and show that 

t = (c/g) sinh 



Hence, show that 

u = c tanh {grjc ); y — cosh (grjc) 


_c 2 j 

cosh (— 

) -i] 

~ g\ 

L \ c > 



These equations enable one to relate the velocity of the rocket, and the 
distance the rocket has travelled relative to the earth in terms of either 
l or r, and they also enable one to relate t and t. 

If the rocket goes with proper acceleration g for (a) one year, (b) 10 
years, (c) 100 years relative to the earth, how far will it go and what is the 
time relative to the rocket? Take g to be 32 ft/sec 2 . 

Problem 8.9 —A space-ship leaves the earth and moves with constant 
proper acceleration g = 32 ft/sec 2 relative to the inertial frame in which it 
is instantaneously at rest, until the space-ship is half way to a star. It then 
decelerates with proper deceleration g until it reaches the star. It turns 
around and returns with proper acceleration g for the first half of the 
return journey and decelerates with proper deceleration g for the last half 
of the return journey. If the total time for the journey is (a) 40 years and 
( b) 4,000 years (relative to the earth) how far away is the star, and what is the 
time for the complete journey relative to the rocket? (Comment: This 
problem and the previous one are taken from Bondi 4 where a full discussion 
of the problem is given.) 

Problem 8.10 —Give a short account of the Mossbauer effect. Describe 
how the effect depends on the temperatures of the source and the absorber. 
Explain how observations of the temperature dependence of the Mossbauer 
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effect have been interpreted in terms of the clock hypothesis that t = 

Problem 8.11 —Use geometrical methods to show that *R> the time for 
the journey of the rocket represented by the ‘world line’ OPS in Figure 8.2 
(page 237) and measured by a clock on the rocket, is equal to /^Vl — v 2 jc 2 , 
where v is the velocity of the rocket and is the time for the journey 
measured by a clock on the earth. (Hint: t' op — OQ sec <j>/g g and t" pft = 
QS sec <j>jg” t , where tan <f> = v/c and g' t = g" = y Vl + (v 2 /c 2 ).) 
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9.1. INTRODUCTION 

The theory of special relativity is confined to the discussion of 
physical phenomena relative to inertial reference frames. It is not 
applicable in non-inertial frames of reference. It was shown in 
Section 1.5 that it is necessary to introduce ‘fictitious’ inertial forces 
if one wishes to apply Newton’s laws of motion in a co-ordinate 
system accelerating or rotating relative to an inertial frame. It is 
possible to measure one’s rotation relative to the fixed stars from 
experiments carried out in a closed laboratory. Newton believed 
that absolute rotations could be measured. After developing the 
special theory of relativity, Einstein strove for ten years to develop 
a general theory of relativity. 

Einstein felt that it should be possible to express the laws of 
physics in a covariant form for all reference frames, that is, the 
mathematical form of the laws of physics should be the same in all 
frames of reference whether they are accelerating or not. This is 
the principle of covariance. The general mathematical equations 
might reduce to simpler mathematical forms in inertial reference 
frames, and they should be consistent with the theory of special 
relativity in inertial reference frames. 

In order to develop the full theory, Einstein found it necessary 
to use tensor analysis. It would be beyond the scope of the present 
book to discuss the general theory comprehensively. The discussion 
is therefore confined to an account of the principle of equivalence, and 
to a few applications of it. This gives the reader a little insight into 
the general theory, and illustrates some of the limitations of the 
special (or restricted) theory of relativity. The reader should not 
expect a complete elucidation of Einstein’s theory of general 
relativity from the following discussion, which is no more than a 
popular account, designed to help the reader to tackle the more 
comprehensive books by Eddington, Tolman, Mollcr, Fock, etc. 

9.2. THE PRINCIPLE OF EQUIVALENCE 

Let an astronaut be in a space-ship which is at rest in an inertial 
frame Z in a region of outer space where there is no gravitational 
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field, as shown in Figure 9.1(a). There are no windows in the space¬ 
ship so that the astronaut cannot look out, and it is soundproof so 
that he cannot hear anything. Everything inside the space-ship is 
in a state of weightlessness. Now let a signal from the earth fire 
some small rockets so that the space-ship starts moving with uniform 



(a) (b) 

y 


(c) 

Figure 9.1. The spaceship has uniform acceleration a relative to the inertial reference frame 
H. The accelerating reference frame K' is the reference frame in which the spaceship is at 
rest; K' has an acceleration a relative to 2. All bodies not acted upon by any forces accelerate 
downwards in K'. According to the principle of equivalence, measurements carried out in K' 
yield the same results as experiments carried out in the non-accelerating reference frame K 
shown in (c), in which there is a gravitational field, opposite in direction to a and of such 
a strength that the acceleration due to the force of gravity is numerically equal to a 

acceleration a relative to E. Consider what happens from the 
viewpoint of the reference frame K', shown in Figure 9.1(b), in 
which the space-ship is always at rest. The astronaut is no longer 
in a state of weightlessness. If he holds up an apple and lets it go, 
the apple will move with uniform velocity relative to the inertial 
reference frame E, but relative to the space-ship it falls downwards 

261 



~Pufuc. Tfeat/icHKittcnZ ~Phy.A.lc.A. 


THEORY OF GENERAL RELATIVITY 


with uniform acceleration. Since all free bodies not acted upon by 
any forces move with uniform velocity relative to Z, they all appear 
to fall with the same acceleration relative to the space-ship. If a 
spring balance, with a weight attached to it, is attached to the roof 
of the space-ship, the weight would remain at rest (or move with 
uniform velocity) relative to Z, unless the spring were under 
sufficient tension to give the weight the same acceleration as the 
space-ship has relative to Z. If the acceleration of the space-ship 
were equal to g, the acceleration due to gravity on the surface of 
the earth, the conditions in the space-ship would be similar to the 
conditions in a space-ship at rest on the surface of the earth. How 
should the astronaut interpret the results in the accelerating space¬ 
ship, if he did not know that rockets were accelerating the space¬ 
ship? He would be quite as justified as Newton was in concluding 
from the observations on a falling apple, that he was in a gravitational 
field. The astronaut could interpret the tension in the spring 
balance as an effect due to the weight’s being attracted in a gravita¬ 
tional field. In fact, the astronaut could not tell the difference 
between times when his space-ship was accelerated relative to the 
fixed stars in a gravitational free region, and times when the space¬ 
ship was at rest (or moving with uniform velocity) relative to the 
fixed stars and when massive bodies were passing underneath the 
space-ship giving rise to a gravitational field. 

In 1911, Einstein proposed that a co-ordinate system K', illustrated 
in Figure 9.1(b) accelerating with uniform acceleration relative 
to the fixed stars, and a system K, illustrated in Figure 9.1(c), at 
rest in a homogeneous gravitational field, are exactly equivalent. 
Experiments carried out under the same conditions in K and K' 
should give the same numerical results. This is the principle of 
equivalence. To quote Einstein 1 : 

But we arrive at a very satisfactory interpretation of this law of experience 
if we assume that the systems K and K' are physically exactly equivalen t, 
that is, if we assume that we may just as well regard the system K as 
being in a space free from gravitational fields, if we then regard K as 
uniformly accelerated. This assumption of exact equivalence makes it 
impossible for us to speak of the absolute acceleration of the system of 
reference, just as the usual theory of relativity forbids us to talk of the 
absolute velocity of a system; and it makes the equal falling of all bodies 
in a gravitational field a matter of course. 

Since all bodies not acted upon by any forces should continue 
to move with uniform velocity relative to an inertial reference frame 
whatever their inertial masses, they should all have the same acceler¬ 
ation relative to an accelerated reference frame. Hence, according 

262 


~Pufuc. Tfeat/ieHtuft'ui/ P/tylieA 



THE PRINCIPLE OF EQUIVALENCE 

to the principle of equivalence all bodies should fall with the same 
acceleration in a homogeneous gravitational field. It was shown 
in Section 1.3 that, if the acceleration of bodies towards the centre 
of the earth is the same, then the inertial mass and the gravitational 
mass of a body are proportional to each other. [If M is the gravita¬ 
tional mass of the earth and m g and m 1 are, respectively, the 
gravitational mass and the inertial mass of a body, one has 


. Gm a M 
/= - ~ 2 ~ = 


or a = 


trig IG. 


mi 


which is constant if m„/mj is a constant.] Thus the equality of 
inertial and gravitational mass follows from the principle of equiva¬ 
lence. It was shown by Eotvos that the inertial and gravitational 
masses of a body are equal to within 1 part in 10 9 . For a recent 
discussion the reader is referred to Dicke 2 . It should be remembered 
that in these experiments, the particles had low velocities, so that 
the equality of gravitational mass and inertial mass has not been 
checked at high velocities comparable to the velocity of light. 

In a region where the gravitational field is uniform one can 
‘transform away’ the gravitational field by transforming to a 
co-ordinate accelerating in the direction of the gravitational field 
with an acceleration numerically equal to the acceleration due to 
the gravitational field. The new co-ordinate system is then an in¬ 
ertial reference frame. In general, gravitational fields are not 
uniform (or homogeneous) throughout all space, so that one cannot 
replace the gravitational fields, throughout all space by transforming 
to a single co-ordinate system accelerating relative to the first. For 
example, one cannot ‘transform away’ the gravitational field of the 
earth throughout all space. 

To quote Fock 3 : 

The Principle of Equivalence is related to the fund amental law of equality 
of inertial and gravitational mass, but it is not identical with that law. 
The latter is a law of general, not of local, charact er whereas the equiva¬ 
lence of accelerations and gravitational fields is entirely local, i.e. refers 
to a single point in space (more exactly, to the spatial neighbourhood of 
the points on a time-like world line). 

The equivalence amounts to the following. By introducing a suitable 
system of co-ordinates (which is usually interpreted as an accelerated 
frame of reference) one can so transform the equations of motion of a 
mass point in a gravitational field that in this new system they will have 
the appearance of equations of motion of a free mass point. Thus a 
gravitational field can, so to speak, be replaced, or rather imitated by a 
field of acceleration. Owing to the equality of ine rtial and gravitational 
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mass such a transformation is the same for any value of the mass of the 
particle. But it will succeed in its purpose only in an infinitesimal region 
of space, i.e. it will be strictly local. 


9.3. THE RATES OF CLOCKS IN GRAVITATIONAL 
FIELDS 

The principle of equivalence will now be applied to calculate the 
rates of clocks in a gravitational field. It will be assumed that all 
velocities are very much less than the velocity of light, and we shall 
work to second order of vjc only. 

Consider a rocket of length h moving with acceleration a relative 
to an inertial frame 27, in which there is no gravitational field, as 
shown in Figure 9.2(a). Let the rocket be instantaneously at rest 
in 27 at the time t — 0. Let two clocks of identical construction, 
labelled 1 and 2, be fixed to the rocket, with clock 1 at the rear end 
and clock 2 at the front end of the rocket, as shown in Figure 9.2(a). 
As examples of suitable clocks, two light sources are chosen which 
have a frequency v 0 when the rocket is at rest in an inertial reference 
frame in which there is no gravitational field. Consider the light 
emitted from clock 1 at a time f = 0, when the rocket is instan¬ 
taneously at rest in the inertial reference frame 27. This light reaches 
clock 2 after a time interval t (measured in 27) given by ct == 
(h + £< at 2 ), which reduces to t — hjc if \at < c. After a time f, clock 
2 is moving with a velocity v = at ^ ahjc relative to 27. To an 
observer situated by clock 2 it will appear as if the light reaching 
him from clock 1 was emitted by a source moving downwards with 
velocity v, so that he will observe the light to be Doppler shifted, 
the frequency being, to first order, v' = v 0 (l — vjc) = r 0 (l — ahjc 2 ). 
This is a lower frequency than that of the light from clock 2. 
Light emitted from clock 2 at the instant t = 0 in 27, has a frequency 
v 0 (l + ahjc 2 ) relative to clock 1 when it reaches clock 1. To first 
order v/c the difference in both cases is v 0 ah/c 2 , if ah/c 2 <1. 

According to the principle of equivalence, these experimental 
results should be exactly the same as when the same experiments 
are carried out in the non-accelerating co-ordinate system K , 
shown in Figure 9.2(c), in which the rocket is at rest and in which 
there is a gravitational field of such a strength that g, the acceleration 
due to the gravitational field, is numerically equal, but opposite in 
direction, to a, the acceleration of the rocket relative to 27. The 
light reaching clock 2 from clock 1 should have a frequency lower 
than clock 2 by an amount v 0 gh/c 2 . Conversely, light reaching 
clock 1 from clock 2 should have a frequency higher than the 
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Figure 9.2. (a) In the inertial reference frame 2 the rocket has an 
acceleration a. Light is emitted from two identical light sources 1 and 
2 at the time t — 0, when the rocket is instantaneously at rest in 2. By 
the time light from source 1 reaches source 2, source 2 is moving relative 
to 2, so that the light from source 1 will appear Doppler shifted to an 
observer by source 2. (b) In the accelerating reference frame K', in which 
the rocket is at rest, the light reaching source 2 from source 1 has a 
lower frequency than source 2. According to the principle of equivalence, 
measurements in K' should be the same as in the non-accelerating reference 
frame K, shown in (c), in which there is a gravitational field. Hence clock 
2 in (c) should go at a faster rate than 1 in K, and visible light from light 
source 1 should be shifted to the red end of the spectrum compared with 
the frequency of the exactly similar light source 2 


frequency of clock 1. The light sources are both at rest in K. No 
light vibrations should be lost on the way, so that one must con¬ 
clude that, in the presence of a gravitational field, the actual fre¬ 
quencies of the stationary clocks differ by an amount v^h/c 2 . 
This is an actual physical difference associated with the presence 
of a gravitational field. Now gh is the difference in gravitational 
potential between the clocks. According to normal convention, 
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clock 2 is at the higher gravitational potential in K [cf. Figure 9.2(c) ], 
since work must be done to raise a mass from clock 1 to clock 2 in 
the gravitational field in K. So far, it has been assumed that g is 
constant between the two clocks. For a varying gravitational field, 
gh has to be replaced by A <j), where A <f> is the difference in gravita¬ 
tional potential between the clocks, so that Ar, the difference 
between the frequencies, is v 0 A^/c 2 . Hence, the fractional difference 
in frequency is 

Ar A d> 


If jq is the frequency of clock 1, and v 2 is the frequency of clock 2 
when the clocks are at rest in a gravitational field [e.g. as in Figure 
9.2(c)] provided A</> < f 2 . 


'.-^(l +^r) 

and 

~ir) 


(9.2) 

(9.3) 


Eqn (9.2) should hold for other processes, such as the successive 
ticks of clocks. According to eqn (9.2), clock 2 should register 
more ticks in a given time interval, and so should register a longer 
time interval than clock 1. If Aq and Af 2 are the time intervals 
registered by clocks 1 and 2 respectively, since the time interval 
measured by each clock is proportional to its frequency, from 
eqn (9.2), 



A h — A#2^1 — c 2 j (9-5) 

where A<£ is the difference in gravitational potential between 
clocks 1 and 2. Commenting on eqn (9.4), Born 4 writes: 


The physical content of this formula is this: Given two equally con¬ 
structed, synchronous clocks initially at rest relative to each other; if 
one of them is exposed to a gravitational field for a certain period of 
time, they will no longer be synchronized, but the clock which was in the 
field will be retarded. 


The reader should remember that, in the above quotation, the 
clock in the gravitational field is at a lower gravitational potential 
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than the clock in the field-free region, so that the clock in the field 
corresponds to clock 1 in eqn (9.4), and according to eqn (9.4) 
clock 1 should go at the slower rate. All processes should take 
place faster near clock 2 on the top of the stationary rocket [illus¬ 
trated in Figure 9.2(c)] than at the bottom of the rocket near clock 
1. For example, radioactive atoms should decay faster near clock 2 
than near clock 1. The changes are illustrated in Figure 9.2(c) 
where it is assumed that the gravitational field arises from a massive 
body. At the higher altitude everything goes faster and the fre¬ 
quency of light reaching the lower altitude is higher than the 
frequency of light from an exactly similar light source at the lower 
altitude. For visible light, light coming from the higher altitude 
(or higher gravitational potential) is shifted to the violet end of the 
spectrum. One has a red shift for light going from the lower to the 
higher altitude (i.e. from lower to higher gravitational potential). 


9.4. THE GRAVITATIONAL SHIFT OF SPECTRAL 
LINES 


It was shown in Section 9.3 that it follows from the principle of 
equivalence that the frequency of a light source is influenced by a 
gravitational field. Let a light source of frequency v v near a star 
correspond to clock 1 and let a similar light source on the earth 
have frequency r 2 , and correspond to clock 2. According to eqn 
(9.3), 


v i = 



(9.3) 


The difference in gravitational potential A<^>, between clocks 1 
and 2 is positive, since net work would have to be done to take a 
mass from the more massive star to the earth. Hence v 1 is less than 
v 2 , so that A x > A 2 , that is, visible light coming from a star is shifted 
towards the red end of the spectrum compared with light from an 
exactly similar light source stationary on the earth. If A A is the 
increase in wavelength, 


AA ^ A <f> 


(9.6) 


For light coming from the sun the fractional shift in a spectral line 
is AA/A~2T2 X 10 -6 compared with the wavelength of the 
same spectral line from a stationary terrestrial source. This is a 
measurable effect, but it is equivalent to the ordinary Doppler 
effect for a source moving with a velocity of only 0-63 km/sec, so 
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that the gravitational red shift of light from the sun may be masked 
by other effects. According to Dicke 2 , Brault has measured the red 
shift of the solar sodium Dj line. To quote Dicke 2 , ‘After correction 
for asymmetries, the red shift was found to agree with the expected 
result with an accuracy of 5 per cent (noise limited)’. White dwarf 
stars are more massive than the sun, so that A <f> and hence AA 
should be correspondingly larger. Though not definitive, the 
results obtained with light from white dwarfs are consistent with 
eqn (9.6). 

At one time it was felt that it would be impossible to check the 
gravitational shift of spectral lines by means of terrestrial experi¬ 
ments. For example, the change in frequency between sea level 
and the top of Mount Everest is only about 1 part in 10 12 . However, 
atomic clocks are beginning to approach the required accuracy. 
With the discovery of the Mossbauer effect, it became possible to 
measure extremely small changes in frequency. All that was 
necessary was to mount the source and the absorber vertically 
above each other at sea level. The source and absorber would 
correspond to clocks 1 and 2 in Figure 9.2(c). Any shift in char¬ 
acteristic frequency between source and absorber would change the 
proportion of recoilless absorptions leading to a change in the 
counting rate of the detector (cf. Figure A5.4). In the experiment 
carried out by Pound and Rebka 5 , a 0-4 curie source of 57 Co in 
metallic iron was separated by a vertical height of 22 m from the 
absorber. Pound and Rebka controlled the temperature very 
accurately, since a temperature difference of 1°K between source 
and absorber would produce an effect comparable with the gravi¬ 
tational shift of the 14-4 keV line. (The temperature dependence 
of the Mossbauer effect was discussed in Section 8.4.) The shift 
expected by Pound and Rebka was about one part in 2-4 X 10 -16 . 
They verified the existence of the gravitational shift and showed 
that it was 0-97 ± 0-04 times the predicted value. 

It can be seen that one need not use the full theory of general 
relativity to calculate the gravitational shift of spectral lines. The 
shift can be calculated sufficiently accurately using the principle 
of equivalence only. 

9.5. THE CLOCK PARADOX INTERPRETED IN 

TERMS OF THE PRINCIPLE OF EQUIVALENCE 

The example given in Section 8.1 and illustrated in Figure 8.1 
is now reconsidered. In this example, a rocket leaves the earth 
with uniform velocity v, turns around quickly and returns with 
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uniform velocity v to the earth. If t L is the time of the complete 
journey measured by a clock at rest on the earth, and if t' R is the 
time for the complete journey measured by a clock at rest on the 
rocket, then, if the clock hypothesis [eqn (8.8)] is assumed to be 
correct and is applied in the inertial frame in which the earth is at 
rest, it is found that 

t's = t L V( 1 - o«/c«) = < £ ( 1 - (8.6) 

The motion is now considered from the reference frames in which 
the rocket is instantaneously at rest. For the first part of the journey, 
when the rocket is moving with uniform velocity v relative to the 
earth, the rocket is at rest in an inertial frame, which is denoted by 
K v as shown in Figure 9.3(a). The earth is moving to the left with 
uniform velocity v in K v Let the time interval between the earth’s 
leaving the rocket and the instant the earth starts to turn around 
be t x as measured by clocks at rest relative to the rocket, and t x as 
measured by a clock at rest on the earth. In K v t x is a proper time 
interval so that, according to the theory of special relativity, 


U — 


:il(l +**•/«*) 


(9.7) 


V(l - v*lc 2 ) 

When the rocket is turning around relative to the earth it has 
an acceleration a = 2vjt % relative to the earth, where t s is the time 
(relative to the earth) taken by the rocket to turn around and 
regain a velocity v in the opposite direction. The reference frame 
in which the rocket is now at rest is an accelerating reference 
frame. According to the principle of equivalence, the results in 
this accelerating frame would be exactly the same as if the measure¬ 
ments were carried out in a ‘stationary’ co-ordinate system K 2 in 
which the rocket is at rest, and in which there is a homogeneous 
gravitational field, in which the acceleration due to gravity is 
numerically equal to the acceleration of the rocket relative to the 
inertial frame in which the earth is at rest, that is, g = 2 v/t 2 , as 
illustrated in Figure 9.3(b). In the reference frame K 2 , the earth 
does the equivalent of ‘vertical’ motion under gravity. In K 2 , the 
earth is in a region of higher gravitational potential than the rocket. 
Relative to K 2 , all clocks on the earth should go at a faster rate 
than similar clocks on the rocket. If t 2 is the time for the earth to 
turn around (relative to clocks at rest on the earth) and t’ 2 is the 
same -ftme interval measured relative to the rocket, according to 
eqn (9.5), 


4 - 4^ - fa) 


(9.8) 
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Figure 9.3. An example of the clock paradox. The motion is con¬ 
sidered from the co-ordinate system in which the rocket is always at 
rest. The journey is considered in three parts. In the first part, 
illustrated in (a), the earth moves with uniform velocity away from 
the rocket, which is at rest in the inertial frame Aj during this part 
of the journey. In the second part of the journey, shown in ( b), the 
earth is accelerating relative to the rocket, and relative to the rocket 
the earth appears to be moving in a gravitational field. During the last 
stage of the journey ( c ) the earth is moving with uniform velocity 
towards the rocket, which is at rest in the inertial reference frame K a 


where h = p<iis the distance of the earth from the rocket measured 
relative to the rocket, and g = 2vjt i . Substituting for t[ from eqn 
(9.7), one finds for h. 


h = vt[ — 0^(1 + iv 2 Jc 2 ) 
Substituting for g and h in eqn (9.8), 

2v vt t 


4 = 4 


1 - 


tn C 1 


• (l + ¥ 2 I<?) 
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or, to second order, 

, 2v 2 

4 = < 2 - 7 Th ( 9 - 9 ) 

Note, the second term in eqn (9.9) is proportional to t v Even 
if t 2 is very much less than t v so that t z can be neglected when 
calculating t L (as was done in Section 8.1), the time interval t' 2 
cannot be neglected when the motion is considered relative to the 
reference frame in which the rocket is always at rest. If the accelera¬ 
tion of the rocket relative to the earth is increased, so that t 2 is 
smaller, g is increased and the second term in eqn (9.9) remains 
precisely the same. If the length of the journey is increased, then 
relative to K 2 the earth is in a region of still higher gravitational 
potential when the earth turns around, the difference in potential 
being proportional to vt v [If the rocket accelerates from rest at 
the beginning of the journey, the difference in gravitational poten¬ 
tial between the earth and the rocket is practically zero, and the 
acceleration time is (to second order) the same relative to the earth 
and the rocket and can be made negligible compared with and t[. 
A similar argument can be applied if the rocket comes to rest at the 
end of the journey.] 

Once the rocket has stopped accelerating with respect to the 
earth, and is moving with uniform velocity v again, the rocket is 
at rest in the inertial reference frame K 3 shown in Figure 9.3(c). 
The theory of special relativity is applicable again, so that, if t 3 
and < 3 are the times for the return journey of the earth to the 
rocket measured by a clock on the earth and by clocks at rest 
relative to the rocket respectively, then since relative to K 3 , t 3 is the 
proper time interval, 


4 = 4(1 - */*)-* — 4(1 + W) (9-10) 

Adding eqns (9.7), (9.9) and (9.10), 

4 + 4 + *8 = 4 + 4 + *3 + h[(i l>2 l < ?) — (2i> 2 /c 2 )] 4 - t 3 \v 2 jc 2 

Since relative to the earth t 2 is negligible, then — t 3 = t L j 2, so 
that 

t' R = t L ( 1 -*»»/«*) (9.11) 

Eqn (9.11) is the same (to second order of v 2 /c 2 ) as the result 
obtained previously in Section 8.1 [cf. eqn (8.6)], when the 

271 


"PiLfLc. Tfeat/nmt atl&qZ TVtyAtcA 




THEORY OF GENERAL RELATIVITY 

motion was always related to the inertial frame in which the earth 
was at rest. 

It should be emphasized that we cannot claim to have resolved 
the clock paradox. What has been done is to work out the problem 
in a different co-ordinate system to the one used in Section 8.1 
and the same answer was obtained. This is similar to the case of the 
motion of a body on a rotating roundabout, when one obtains the 
same answer if one uses a rotating co-ordinate system or refers 
everything to the lab.-system (cf. Section 1.5). 


9.6. THE RATES OF CLOCKS IN SATELLITES 

With the advent of earth satellites, interest in the clock paradox 
has been stimulated again recently. When a satellite is in orbit 
above the earth there are two effects to be considered when com¬ 
paring the rates of clocks on the earth and in the satellite. Firstly, 
if one accepts the clock hypothesis [eqn (8.8)], there is an effect 
due to the motion of the satellite relative to the earth. Let v 0 be 
the frequency of a process measured on the earth, and let j> 0 + 
Aiq be the frequency of the same process measured in a satellite 
moving with uniform speed v. From eqn (8.8), 


? B = t L V( 1 - v 2 /c 2 ) 1 - ^jc 2 ) 


Since r 0 is proportional to t L and v 0 + Ajq is proportional to t' R , 
the increase in the frequency of a clock in the satellite is given, to 
second order, by 

(9.12) 

that is, the clock in the satellite goes at a slower rate than a similar 
clock stationary on the earth. If m is the mass of the satellite, M 
the mass of the earth, G the gravitational constant, r 0 the radius 
of the earth, and r the geocentric radius of the satellite’s orbit 
(which is assumed to be circular), one has 


or 


mv 2 mGM 
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In eqn (9.13), g, which is equal to GMjrfj, is the acceleration due 
to gravity at the surface of the earth. Substituting in eqn (9.12), 


Av t 

v o 


1 Po([o) 
2 c2\r) 


(9.14) 


Secondly, the clock in the satellite is at a higher gravitational 
potential than a similar clock on the earth, and consequently 
should go at a faster rate. According to eqn (9.1) the increase in 
frequency Ar 2 of the clock in the satellite due to the difference in 
gravitational potential is given by 


Av 2 A<f> 
v 0 ~ e 2 

But, 

^£-^(1-1) 

Hence, 

4a = 9(,_ i) 

r 0 c 2 \ rj 


(9.15) 


Adding eqns (9.14) and (9.15), one finds for Av, the total increase 
in the frequency of the clock in the satellite, 


Av 

v o 


= # 


3£o 

2 r 


(9.16) 


If- 2 

2 r 


< 1 or r > ^ r 0 , 


that is, if the altitude of the satellite is 


> ~ (=o=3,200 km above sea level), Av is positive, since the effect 

due to the change in gravitational potential predominates, and a 
clock in the satellite reads more (and the astronaut ages more) than 
a similar clock (or an astronaut) at rest on the earth. For r < 3r 0 /2, 
that is, if the altitude of the satellite is less than r 0 /2 = 2 = 3,200 km 
above sea level, then the effect given by eqn (9.14) predominates, 
and the clock on the satellite reads less than a clock on the earth, 
and the astronaut should age less than a person on the earth. 
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We are now in a position to understand the quotation taken from 
Cochran 6 and given in Section 8.5. The clock in the rocket 
projected vertically is in a region of higher gravitational potential 
than a clock at rest on the platform and consequently goes at a 
faster rate. The clock in a low altitude satellite goes slower than 
the clock at rest on the platform as it is moving relative to the clock 
at rest on the platform in a region where the gravitational potential 
is the same as on the platform. 


9.7. THE BENDING OF LIGHT IN A GRAVITATIONAL 
FIELD 


In Newtonian mechanics one could in principle use rigid bodies as 
reference bodies and rulers to measure co-ordinates of events. The 
classical concept of rigid bodies had to be refined in the theory of 
special relativity, but one could still in principle use light signals, 
in vacuo, to define straight lines and measure the times and co¬ 
ordinates of events, since according to the theory of special relativity 
light should travel in straight lines with uniform speed in vacuo 
relative to inertial reference frames. 

Consider a space-ship moving with uniform acceleration relative 
to an inertial reference frame E. If light enters through a small 
hole in the space-ship, it travels in a straight line relative to E. 
Relative to the accelerating space-ship the light must travel in a 
curved path. According to the principle of equivalence, observations 
on the accelerating space-ship should be the same as observations 
carried out by a stationary observer in a homogeneous gravitational 
field, of such a strength that the acceleration due to gravity in the 
gravitational field is equal and opposite to the acceleration of the 
space-ship. Hence, light rays should be curved in a gravitational 
field, the light rays being bent in the direction of the acceleration 
due to gravity. 

As an example, Einstein considered the deflection of light rays 
from a distant star, when the light rays passed close to the periphery 
of the sun. Using the difference in clock periods calculated using 
the principle of equivalence, and given by eqn (9.1), Einstein 
showed that a light ray which passes the sun at a distance R from 
its centre should be deflected through an angle a given by 


2 GM 
a ~ 1*R 


(9.17) 


where G is the gravitational constant and M is the mass of the sun. 
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Substituting for G, M and the value of the radius of the sun for R, 
it follows from eqn (9.17), that a = 0-87" of arc. 

There is an alternative approach based on the equality of inertial 
and gravitational mass. It was shown in Section 5.8.3, that a 
photon of energy hv has associated with it an inertial mass kvjc 2 , 
so that the photon should have associated with it a gravitational 
mass hv/c 2 and should be atlracted towards the sun. If one con¬ 
siders a photon incident at an impact parameter R, a straightfor¬ 
ward application of the theory of collisions shows that the deflection 
of the photon is again given by eqn (9.17) (cf. French 7 ). 

After developing his full theory, Einstein recalculated the deflec¬ 
tion of light passing near the sun using the full theory of general 
relativity, and he found that the deflection of light by the sun 
should be double that given oy eqn (9.17), that is, 1-75" of arc. 
It has been shown by Schiff 8 that the value predicted by the full 
theory of general relativity follows from the principle of equivalence 
if one allows for both the changes in the rates of clocks, and in the 
lengths of rods. 

The light rays from stars which pass close to the sun are generally 
invisible, except during a solar eclipse. Two British expeditions 
investigated the deflection of light by the sun during the total 
eclipse of 29th May, 1919. One group went to the Isle of Principe 
in the Gulf of Guinea, West Africa and the other went to Sobral 
in North Brazil. After correcting for systematic errors the final 
results (reduced to the edge of the sun) obtained at Sobral and 
Principe with their ‘probable accidental errors’, were 
Sobral l"-98 ± 0"-12 
Principe 1"• 61 ± 0"-30 

These were consistent with the value calculated on the basis of the 
full theory of general relativity, which is double that given by eqn 
(9.17). A fuller account of the observations is given by Eddington 9 . 
The results have been confirmed by experiments carried out during 
later eclipses of the sun. 

It has been shown in this section that, in the strict Euclidean 
sense, light does not travel in straight lines in a gravitational field, 
so that light rays cannot be used to mark out a rectangular Cartesian 
co-ordinate system in which Euclidean geometry can be applied. 
It has been shown that the rates of clocks depend on the gravitational 
potential. Lengths of rods also depend on the strength of the 
gravitational field. It is, therefore, not surprising to find that, in 
the full theory of general relativity, the numerical value of the speed 
of light depends on the strength of the gravitational field. Hence 
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the principle of the constancy of the velocity of light must be 
modified in the presence of strong gravitational fields. To quote 
Einstein 10 : 

• • • our result shows that, according to the general theory of relativity, 
the law of the constancy of the velocity of light in vacuo , which constitutes 
one of the two assumptions in the special theory of relativity and to 
which we have already frequently referred, cannot claim any unlimited 
validity. A curvature of rays of light can only take place when the 
velocity of propagation of light varies with position. Now we might think 
that as a consequence of this, the special theory of relativity, and with it 
the whole theory of relativity, would be laid in the dust. But in reality 
this is not the case. We can only conclude that the special theory of 
relativity cannot claim an unlimited domain of validity; its results hold 
only so long as we are able to disregard the influences of gravitational 
fields on the phenomena (e.g. of light). 

Near the earth the gravitational field is so weak that the deviations 
of light rays from straight lines can, for all practical purposes, be 
neglected, though the reader should bear in mind that there are 
limitations to the principle of the constancy of the velocity of light. 

9.8. ROTATING REFERENCE FRAMES 

It was pointed out in Section 1.5 that if one wants to apply Newton’s 
laws of motion in a reference frame rotating with uniform angular 
velocity to relative to an inertial frame, one has to introduce the 
centrifugal force —tmo x (to X r) and the Coriolis force — 2mto x 
In Section 1.5 these forces are called fictitious, as they produce 
no acceleration relative to an inertial frame. It is interesting to 
note that both the centrifugal and Coriolis forces acting on a body 
are proportional to the mass of the body. Newton felt that absolute 
rotations could be measured in the way described in Section 1.5. 
For example, Newton suggested that from the tension in the cord 
joining two globes rotating about their common centre of gravity, 
one can determine the amount of their rotation. Newton took this 
to be evidence in favour of absolute space. This view was criticized 
some twenty years.later by Bishop Berkeley, who wrote: 

If every place is relative, then every motion is relative and as motion 
cannot be understood without a determination of its direction which in 
its turn cannot be understood except in relation to our or some other body. 
Up, down, right, left, all directions and places are based on some relation 
and it is necessary to suppose another body distant from the moving 
one • • • so that motion is relative in its nature, it cannot be understood 
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until the bodies are given in relation to which it exists, or generally 
there cannot be any relation , if there are no terms to be related. 
Therefore, if we suppose that everything is annihilated except one globe, 
it would be impossible to imagine any movement of that globe. 

Let us imagine two globes and that besides them nothing else material 
exists, then the motion in a circle of these globes round their common 
centre cannot be imagined. But suppose the heaven of fixed stars were 
suddenly created and we shall be in a position to imagine the motion of 
the globes by their relative position to the different parts of the heaven. 

Newton’s views were also criticized by Mach in the second half 
of the nineteenth century; e.g. Mach wrote: 

Obviously it does not matter if we think of the earth as turning round on 
its axis, or at rest while the fixed stars revolve round it. Geometrically 
these are exactly the same case of a relative rotation of the earth and the 
fixed stars with respect to one another. But if we think of the earth 
at rest and the fixed stars revolving round it, there is no flattening of the 
earth, no Foucault’s experiment, and so on—at least according to our 
usual conception of the law o f inertia. Now one can solve the difficulty 
in two ways. Either all motion is absolute, or our law of inertia is 
wrongly expressed. I prefer the second way. The law of inertia must be 
so conceived that exactly the same thing results from the second sup¬ 
position as from the first. By this it will be evident that in its expression, 
regard must be paid to the masses of the universe. 

The quotations from Berkeley and Mach are taken from Sciama 11 . 
Mach points out that one cannot conclude from the flattening of 
the earth, due to its rotation about its axis, that absolute space 
exists. All one can conclude: is that the effect is associated with 
the rotation of the earth relative to the rest of the matter in the 
universe. Mach felt that, if the theory were properly expressed, 
it should give the same results when the earth is rotating relative 
to the rest of the matter in the universe, as would be obtained if 
the rest of the matter in the universe were to rotate around a 
stationary earth. If the laws of physics are to be the same in all 
reference frames, as required by the principle of covariance, then 
the centrifugal and Coriolis forces must come naturally out of the 
general theory of relativity. 

The interpretation of phenomena in a rotating reference frame, 
suggested by Mach, is illustrated by a simple example. Consider a 
circular roundabout rotating inside a concentric circular room. 
An observer on the rotating roundabout will tend to continue in 
a straight line relative to the fixed stars, and if he sits on the round¬ 
about he will feel himself tending to fly outwards. If the roundabout 
stopped rotating and the circular wall started rotating in the opposite 
direction, when he looked at the wall the observer would see the 
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same things on the circular wall going in the same direction as 
before, but he would no longer feel forces tending to push him 
outwards. He could carry out experiments to ‘show’ that the 
roundabout was rotating in the first instance but not in the second. 
For example, he could attach a weight by a spring balance to the 
axis of the roundabout. If the weight rested on the roundabout, 
the spring balance would register a force in the first instance but 
not in the second. However, the two systems considered so far are 
not really equivalent. In the first case not only is the roundabout 
rotating relative to the walls of the room, but it is rotating relative 
to the rest of the matter in the universe. To be completely equivalent 
all the other masses in the universe would have to rotate around 
the stationary roundabout in the second case. If they did so, 
would the ‘stationary’ observer then feel a force tending to pull him 
outwards from the centre of the stationary roundabout? The 
generally accepted view, at present, is that he would. The origin 
of such a force is generally attributed to gravitational fields due to 
the motion of the distant masses in the universe. 

According to Newton’s theory of gravitation, the force of gravita¬ 
tional attraction between two masses m 1 and m 2 is equal to G{m 1 m^) /r 2 , 
whether the masses are moving or not. According to the theory of 
special relativity, force is not absolute [cf. eqns (5.74), (5.75) and 
(5.76)]. Hence, if the two masses are moving relative to an observer, 
the force of gravitational attraction between them should be 
measured to be different from the value obtained when the masses 
are at rest relative to the observer. As an analogy, consider two 
electric charges. When the charges are at rest relative to an inertial 
frame, the electrostatic force between them is given by Coulomb’s 
law. If the charges are moving, then there are velocity dependent 
magnetic forces between the charges in addition to electric forces. 
If the electric charges are accelerating there is a contribution to 
the electric and magnetic fields which depends on the acceleration 
of the charges. The acceleration dependent fields are the radiation 
terms, which vary as 1/r. The radiation terms are not symmetrical 
in the direction of the acceleration of a charge and the direction 
opposite to it, except in the limit of zero velocity. Similarly, if 
masses are accelerating relative to the observer it is quite feasible 
that the gravitational forces between two masses depends on the 
acceleration of the masses relative to the reference frame chosen. 
Thus it is feasible that the masses of the universe rotating around 
a stationary roundabout would give rise to gravitational forces 
tending to pull the observer away from the centre of the roundabout. 
In the case of the stationary roundabout, the centrifugal force 
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would be interpreted as a gravitational force arising from the 
rotation of the rest of the masses of the universe around the stationary 
roundabout. It is well known that the centrifugal force due to the 
rotation of the earth and t he gravitational force due to the mass of 
the earth cannot be separated experimentally, and for this reason 
tables of the acceleration due to gravity at various points on the 
earth’s surface often include the contribution of the centrifugal 
force due to the rotation of the earth. (One can, of course, estimate 
the contribution due to the centrifugal force, but one cannot separate 
it experimentally from the gravitational force due to the earth.) 

Because of their great distances from the earth it might appear, 
at first sight, impossible for distant stars to affect terrestrial phe¬ 
nomena. The gravitational field due to a stationary mass is propor¬ 
tional to 1 /r 2 , whilst the contribution to the gravitational field due 
to the acceleration of a mass may very well be proportional to 
1 (r (as is the case in electromagnetism). These fields do not neces¬ 
sarily cancel to zero for points off the axis of the roundabout. The 
number of stars between r and r + dr is proportional to r 2 dr, so 
that for a 1/r law the contribution to the field near the centre of 
the stationary roundabout w ould be proportional to r dr. Hence the 
total force would diverge, if one could integrate from 0 to co. 
This would be true for a 1 /r 2 law also. Of course, the actual total 
gravitational force would depend on the boundary conditions, 
such as the size of the universe, the recession of distant nebulae, etc. 
However, the possibility exists that the distant stars may have some 
effect on terrestrial phenomena. According to the principle of 
covariance one should not be able to say by experiment whether 
the roundabout is rotating relative to the rest of the matter in the 
universe, or whether the rest of the matter in the universe is rotating 
relative to the roundabout; the laws of physics should be the same 
in both cases. 

We shall now consider the rate of a clock which is fixed to a 
roundabout which is rotating with angular velocity m relative to an 
inertial frame 27. Let the clock be at a distance r from the axis of 
rotation of the roundabout. The theory of special relativity can 
be applied in the inertial frame 27. The clock is moving with a 
linear velocity rco relative to 27. If v is the frequency of a clock 
stationary in 27 (or a clock on the axis of rotation of the roundabout) 
and v 0 is the frequency of a clock on the roundabout, using the 
clock hypothesis [eqn (8.8)] and the theory of special relativity, 

„ 0 = vV(l - .!/ 2 /c 2 ) ^ v (1 - ir 2 (9.18) 
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The rates of the clocks will now be considered from the reference 
frame in which the roundabout is at rest, and in which the rest of 
the masses in the universe are rotating. It is fairly generally accepted 
that in this reference frame there should be a gravitational field 
(generally attributed to the rotation of the stars), which gives the 
correct centrifugal and Coriolis forces. To quote Moller 12 : 

There is one effect of this kind, however, which, although small, is of 
theoretical importance since it throws new light on the origin and nature 
of the centrifugal and Coriolis forces appearing in a rotating system of 
co-ordinates S. According to the idea of Einstein underlying the general 
principle of relativity (cf. Section 82), these forces are gravitational 
forces originating from the rotation of the distant celestial masses relative 
to S, and such ‘non-permanent’ gravitational fields should satisfy the 
same general field equations as the permanent fields. The approximate 
solutions (34) for weak fields do not directly allow us to treat the effects 
of the distant celestial masses, but we may expect that a rotating spherical 
shell of uniform mass density will produce effects inside the shell similar 
to the rotation of the distant celestial masses 

For a rotating shell of matter, Thirring, [ Phys. Z. 19, (1918) 33; 22 
(1921) 29)] found the interesting result that the field in the interior 
of the shell as determined by the equations (30), is similar to the field 
in a rotating system of co-ordinates thus leading to gravitational forces 
similar to the usual centrifugal and Coriolis forces. 

For a complete discussion the reader is referred to Moller 12 and 
to Thirring. In order to illustrate the interpretation of phenomena 
on a rotating roundabout, we shall consider a simplified model. 
The centrifugal force acting on a body of mass m resting at a distance 
r from the axis of a rotating roundabout is mrm % . It acts away from 
the axis of rotation. The work done in taking a unit mass at negli¬ 
gible speed from the axis to a radial distance r is — I m> 2 dr= —|r 2 oA 

Jo 

If the centrifugal force is a gravitational force, relative to the 
roundabout, the scalar gravitational potential difference due to the 
centrifugal gravitational field is equal to 

A<£ = -|r 2 « 2 (9.19) 

When calculating the above scalar gravitational potential difference, 
effects associated with the velocity dependent Coriolis forces were 
ignored. This is analogous to neglecting the magnetic force on an 
electric charge moved from one point to another. It will be assumed 
that a clock on the axis of the roundabout (whose linear velocity 
relative to the laboratory reference frame E is zero), is synchronous 
with all the clocks at rest in E. If the difference in gravitational 
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potential between a clock of frequency v on the axis of the round¬ 
about and a clock of frequency v 0 at a radial distance r is given by 
eqn (9.19), then, according to eqn (9.2), the clock on the axis 
should go at the faster rate, since it is at the higher gravitational 
potential. From eqn (9.2) [cf. Figure 9.2{c)~\, 


”o=*(l+^)=r(l-^) (9-20) 


This is in agreement with eqn (9.18). An experiment of this type 
was carried out by Hay, Sehiffer, Cranshaw and Egelstaff 13 using 
the Mossbauer effect. The experiment is outlined in Section 8.4. 
Their results were in agreement with eqns (9.18) and (9.20). 

The agreement between eqns (9.18) and (9.20) shows that 
one gets the same answer whether one uses the theory of special 
relativity in an inertial reference frame, or if one uses a non-inertial 
rotating reference frame, provided one introduces the centrifugal 
gravitational field in the rotating system. To quote Born 14 : 


Thus from Einstein’s point of view Ptolemy and Copernicus are equally 
right. What point of view is chosen is a matter of expediency. For the 
mechanics of the planetary sy stem the view of Copernicus is certainly the 
more convenient. But it is meaningless to call the gravitational fields 
that occur when a different system of reference is chosen ‘fictitious’ 
in contrast with the ‘real’ fieids produced by near masses: it is just as 
meaningless as the question of the ‘real’ length of a rod in the special 
theory of relativity. A gravitational field is neither ‘real’ nor ‘fictitious’ 
in itself. If has no meaning at all independent of the choice of co¬ 
ordinates just as in the case of the length of a rod. Nor are the fields 
distinguished by the fact that some are directly produced by masses 
while others are not; in the one case it particularly is the near masses 
that produce an effect; in the other it is the distant masses of the cosmos. 

In addition to the rates of clocks, the lengths of rods are affected 
by rotation. Let the rotating roundabout, shown in Figure 9.4, 
rotate with angular velocity m relative to E. Let an observer O' be 
at rest on the rotating roundabout. Let the observer O' have a 
measuring rod of proper length / 0 , when it is at rest in an inertial 
reference frame. The observer O' sets out to measure the diameter 
and the circumference of the rotating roundabout. His operations 
are considered from the inertial frame E in which the roundabout 
is rotating. The theory of special relativity can be applied in E. 
When the observer O' places his measuring rod along a diameter 
of the roundabout in Figure 9.4, the length of the measuring rod is 
perpendicular to its direction of motion and the rod should not be 
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Lorentz contracted in the direction of its length relative to E. Let 
O' measure the length of the diameter of the roundabout to be X 
units. When O' measures the circumference of the roundabout by 
placing the measuring rod along the periphery of the roundabout 



Figure 9.4. A circular roundabout rotates relative 
to the inertial reference frame 2. An observer 
on the rotating roundabout sets out to measure the 
diameter and circumference of the rotating round¬ 
about. When his measuring rod is placed along 
the diameter of the roundabout, it is moving per¬ 
pendicular to its length, and its length is not 
Lorentz contracted relative to 2. When placed 
along the circumference of the rotating roundabout, 
the measuring rod is moving parallel to its length 
and should be Lorentz contracted relative to 2 

O's measuring rod is moving in a direction parallel to its length, as 
shown in Figure 9.4, and the measuring rod is Lorentz contracted to 

/ 0 V(1 - » 2 /c 2 ) = / 0 V(1 - r 2 co 2 /c 2 ) = 1 0 (1 ~ |r 2 cu 2 /c 2 ) 

relative to E. Hence, relative to E, O' must measure the circum¬ 
ference to be greater than 77 ^=3-142A) units. Now the measure¬ 
ments of the positions of the ends of O's measuring rod as 0' moves 
along the periphery of the roundabout is a series of events. These 
events are independent of any particular co-ordinate system. Hence 
relative to the reference frame in which the roundabout is at rest, 
the circumference must also be measured to be greater than ttX 
by a factor *—'(1 — r 2 a> 2 / 2 c 2 ) -1 . 

Since, in the reference frame in which the roundabout is at rest, 
the circumference of a circle is not equal to it times its diameter, 
Euclidean geometry is not applicable on the rotating roundabout. 
Since the length of the measuring rod varies with its radial position, 
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the ratio of the circumference to the diameter varies with radial 
position. Furthermore, the rates of clocks fixed to the rotating 
roundabout depend on their positions, so that it is impossible to 
synchronize clocks fixed at various positions on the rotating 
roundabout. In the theory of special relativity we used one co¬ 
ordinate system for the whole of space, and using the principle of 
the constancy of the velocity of light one could, in principle, use 
light signals to synchronize all the clocks at rest in that co-ordinate 
system. This is not possible in a rotating co-ordinate system. One 
must change one’s ideas of space and time in the presence of the 
centrifugal gravitational field. One cannot use one single co-ordi¬ 
nate system for the whole of space, but one must use infinitesimal 
local co-ordinate systems and allow the metric to vary from point to 
point. 

Relative to the stationary roundabout, the distant stars would 
have a velocity rco, and for sufficiently large values or r, the stars 
would be moving relative to O' with linear velocities exceeding 
3 X 10 s m/sec (the terrestrial value of the velocity of light). At 
first sight this appears to be a contradiction of the conclusion 
derived in Section 5.3.2, that the velocities of all material bodies 
must be less than c. However, the restriction u < c = 3 X 10 8 
m/sec is restricted to the theory of special relativity. According to 
the general theory, it is possible to choose local reference frames in 
which, over a limited volume of space, there is no gravitational 
field, and relative to such a reference frame the velocity of light is 
equal to c. However, this is not true when gravitational fields are 
present. In addition to the lengths of rods and the rates of clocks the 
velocity of light is affected by a gravitational field (cf. Section 9.7). 
If gravitational fields are present the velocities of either material 
bodies or of light can assume any numerical value depending on the 
strength of the gravitational field. If one considers the rotating 
roundabout as being at rest, the centrifugal gravitational field 
assumes enormous values at large distances, and it is consistent with 
the theory of general relativity for the velocities of distant bodies to 
exceed 3 X 10 8 m/sec under these conditions. 

9.9. REFERENCE FRAMES ACCELERATING RELATIVE 
TO THE FIXED STARS 

It was pointed out in Section 1.5 that, when one is in a train, one 
can tell when the train is accelerating. If one wants to apply 
Newton’s laws in a reference frame having a linear acceleration 
relative to an inertial frame S, one has to introduce inertial forces. 
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These were assumed to be fictitious in Section 1.5. If the laws of 
physics are to be the same in all references frames, then these 
inertial forces must come naturally out of the full theory of general 
relativity. It was illustrated in Section 9.2 how it would be 
impossible for an astronaut in a space-ship to say, without looking 
at something external to the space-ship, whether the space-ship 
was undergoing a jerky accelerated motion due to its rockets firing 
intermittently, or whether large masses were passing near the 
space-ship giving rise to gravitational fields. Thus one cannot 
separate the effects due to acceleration from the effects due to a 
gravitational field. The reader has probably noticed that, when he 
is in an elevator, the parcels he is carrying are heavier or lighter 
depending on the acceleration of the elevator. 

Relative to an inertial frame the ‘fixed’ stars are at rest or moving 
with uniform velocity. However, relative to a reference frame 
accelerating relative to an inertial frame the stars are accelerating. 
It is quite feasible that accelerating masses give different gravita¬ 
tional forces from the gravitational forces due to the same masses 
when they are moving with uniform velocity (cf. electromagnetic 
case discussed by Rosser 15 , Section 9.5). The conditions in an accel¬ 
erating reference frame are different from the conditions in inertial 
frames, since the stars are accelerating relative to the accelerating 
reference frame. It seems plausible to try to interpret inertial forces 
as gravitational forces due to the accelerations of the stars relative to 
the reference frame chosen. The laws applicable in the accelerating 
reference frame would be the general laws, and could be applied in 
inertial frames also, but in the latter case the laws might take 
simpler forms. [For example, in the electromagnetic case, the fields 
E and B reduce to E = qrj Attest 3 and B = 0, ifu = u = 0 relative 
to an inertial reference frame.] The events observed are inde¬ 
pendent of any particular reference frame, but the actual measures 
of physical quantities depend on the arbitrary standard of rest 
chosen. 

Since the rates of clocks and the lengths of rods are influenced by 
a gravitational field one cannot use one single reference frame with 
synchronized clocks distributed throughout space, in the presence 
of a gravitational field. One can only use infinitesimal local co¬ 
ordinate systems, and must allow the rates of clocks, such as atomic 
clocks, to vary from point to point depending on the gravitational 
field, which in turn depends on the distribution of matter. 

It should be noted that, by suggesting that inertial forces are due 
to the gravitational fields of distant stars, we have gone further 
than the way in which we used the principle of equivalence in 
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Sections 9.2 to 9.7. There it was merely postulated that measure¬ 
ments in an accelerating reference frame would give the same 
numerical results as similar experiments performed in a stationary 
reference frame in which there was a gravitational field of appropriate 
strength. In Sections 9.8 and 9.9 it was suggested that, relative 
to non-inertial reference frames, the inertial forces may be inter¬ 
preted as actual gravitational fields due to the acceleration of the 
distant masses of the universe relative to the reference frame chosen, 
that is, inertial forces are gravitational forces. 

9.10. THE GENERAL THEORY OF RELATIVITY 

It has been illustrated how, in the presence of a gravitational field, 
one cannot use Euclidean geometry. Einstein 163, postulated that 
‘all gaussian co-ordinate systems are essentially equivalent for the 
formulation of the general la ws of nature’. 

According to the principle of covariance, the laws of nature 
should have the same mathematical form in all these gaussian 
co-ordinate systems. In order to develop a covariant theory, 
Einstein had to use general tensor analysis. This goes beyond the 
scope of the present book. A reader interested in the full theory is 
referred to Einstein 18 and to Born 17 for popular accounts of the 
theory. For a more comprehensive popular account the reader is 
referred to Eddington 9 . For a full mathematical account of the 
theory the reader is referred to Eddington 18 and to the books by 
M 0 ller, Tolman and Fock quoted previously. 

Einstein assumed that due to the effects of gravitational fields on 
on standard rods and clocks, the distribution of matter determines 
the local space-time geometry in a particular gaussian co-ordinate 
system. When there is no gravitational field the space-time geo¬ 
metry is the Minkowski space-time geometry of the theory of special 
relativity. According to Einstein the paths of light rays in non- 
Euclidean space-time geometries are geodesics. Using his full 
theory of general relativity Einstein was able to calculate the 
gravitational shift of spectral lines and the deflection of light near 
the sun [which is twice that given by eqn (9.17)]. It has been 
shown that these results can be obtained to second order r 2 /c 2 from 
the principle of equivalence, without using the full theory of general 
relativity. Einstein’s full theory gave Newton’s theory of gravitation 
as a first approximation, but there were small differences between 
the two theories. Einstein predicted that the orbit of the planet 
Mercury is not a ‘static’ ellipse, but the perihelion of the orbit 
should precess by 43” of arc per century. Experimentally, it has 

285 


"Pu.'hjc. Tfaql/m omlttm / ~Ph.y.S.lc.6. 



THEORY OF GENERAL RELATIVITY 


been found that the total precession of the perihelion of the orbit of 
Mercury is about 5,600" of arc per century. All but about 40" of 
arc per century can be accounted for by the perturbations due to the 
gravitational attractions of the other planets. Einstein’s theory 
accounts for the extra 40" of arc per century. 

In the general theory the distribution of matter determines the 
metric of the gaussian co-ordinate system chosen. The equations 
of the theory of the general relativity are expressed in the form 
of differential equations. The solutions of these equations depend on 
the boundary conditions. It is still a matter of some controversy as to 
how much effect distant stars have on terrestrial phenomena. It was 
suggested in Section 9.8 that in rotating reference frames, inertial 
forces can be interpreted as gravitational forces arising from the rota¬ 
tion of distant stars. The best way to determine how the motion of a 
body affects its gravitational field would be to perform laboratory 
experiments. However, the gravitational field is very weak and the 
expected changes in the gravitational field would only be of order 
a 2 /c 2 so that there is no prospect of laboratory measurements to 
settle the question. There is therefore room for considerable specula¬ 
tion. There are some people who go so far as to claim that all the 
inertial properties of matter can be attributed to the gravitational 
attractions of distant stars. For a popular account of this view the 
reader is referred to Sciama 11 . Sciama has been criticized by 
Bridgman 19 . For an alternative view of the theory of gravitation 
the reader is referred to Fock 20 . 

It is not easy to see what it is that happens to clock A, which 
undergoes the journey illustrated in Figure 8.1, which can make 
clock A read less than clock B, which remains at rest in an inertial 
frame (the earth). The difference between the readings of the 
clocks is generally attributed to the accelerations of clock A relative 
to an inertial frame. However, the actual magnitude of the dif¬ 
ference in the readings of the clocks depends on how long, and at 
what uniform speed, clock A travels relative to the earth. It has been 
illustrated in this chapter how the distant masses of the universe may 
affect the inertial properties of terrestrial bodies. According to 
Mach’s Principle the rotation of a body should not be treated in 
isolation from the rest of the matter in the universe. Similarly, one 
should probably not consider the two clocks in complete isolation, 
but one should realize that they have different motions relative to 
the rest of the matter in the universe, even when the rocket is 
moving with uniform velocity relative to the earth. Consequently, 
the clocks may experience different gravitational fields due to the 
rest of the matter in the universe, leading in turn to different clock 
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readings. For a discussion of an example of the clock paradox, 
which takes into account the rest of the matter in the universe, the 
reader is referred to Whitrow 21 . 

According to Sciama 11 , the inertia of matter is due to gravitational 
forces due to the distant matter of the universe. When discussing 
the clock paradox in terms of a travelling twin, Sciama writes 11 : 

Either man may be regarded as stationary; the difference between 
them springs from their different relations to distant matter. The clock 
paradox thus has exactly the same status as Newton’s experiment with 
the rotating bucket of water. Some people have claimed that the two 
men are, in fact, symmetrical, and so should age equally during the 
journey. They have clearly overlooked the relevance of distant matter. 
If there were no such matter the men would indeed be symmetrical, and 
would age equally, but in that case there would be no such thing as 
inertial frames, or inertia, and no rocket would be needed to accelerate 
A. Life would be quite different in such a universe. 

Thus according to Sciama’s speculative view, in the absence of 
the rest of the matter in the universe, that is in a universe consisting 
of only two clocks, it is possible that Dingle’s view might be correct. 
Our primary concern should, however, be with the actual universe 
as it is. A substantial amount of evidence has been presented in the 
last two chapters in favour of the alternative view that, in the actual 
universe we live in, for the case illustrated in Figure 8.1, the 
travelling twin would age less than the twin who remained on the 
earth. As long as controversy persists, the most satisfactory solution 
is to perform as many experiments as possible, even though, in the 
author’s opinion, such experiments would almost certainly prove 
Dingle to be wrong. 
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SUPPLEMENTARY READING 

For popular accounts of the general theory the reader is referred to Relativity. 
The Special and the General Theory by Einstein, Einstein’s Theory of Relativity 
by Born, and Space, Time and Gravitation by Eddington. 

For an English translation of Einstein’s original papers the reader is 
referred to The Principle of Relativity (Dover Publications Inc). For an 
introduction to the mathematical theory, the reader is referred to The 
Einstein Theory of Relativity by Lieber and Lieber. F or mathematical accounts 
of the general theory the reader is referred to The Meaning of Relativity by 
Einstein, The Mathematical Theory of Relativity by Eddington, The Theory 
of Relativity by Moller, The Theory of Relativity by Pauli, The Theory of Space 
Time and Gravitation by Fock, Relativity, Thermodynamics and Cosmology by 
Tolman and An Introduction to the Theory of Relativity by Bergmann. 

Approximate derivations of the gravitational red shift, the deflection of 
light in the gravitational field of a star and the precession of the perihelion 
of Mercury are given in Electromagnetism and Relativity, by E. P. Ney. For 
an account of experimental work the reader is referred to Dicke 2 . 

PROBLEMS 

Problem 9.1 —Describe how the gravitational shift of spectral lines and 
the bending of light in a gravitational field can be interpreted in terms of 
the principle of equivalence. 

Problem 9.2 —Calculate the fractional difference in frequency between 
the frequency of an atomic process on top of the spire of Salisbury Cathedral 
(altitude 404 ft or 124 m) and the frequency of the same process at ground 
level. (Take g to be 9-81 m/sec 2 .) 
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Problem 9.3 —Calculate the difference in the readings of clocks at sea 
level and at the top of Mount Everest respectively after one year (altitude 
29,000 ft or 8842 m). Take the acceleration due to gravity to be a constant 
equal to 9-81 m/sec. 

Problem 9.4 —A satellite is in a circular orbit at an altitude of 4,500 km 
above the earth. Calculate the fractional difference between the frequency 
of an atomic process in the satellite and the frequency of the same atomic 
process on the surface of the earth. The radius of the earth is equal to 
6367 km. 

Problem 9.5— Describe how inertial forces are interpreted in the general 
theory of relativity. 

Problem 9.6 —Two identical atoms are hung over a pulley by a weightless 
thread. Initially, one of the atoms (labelled 1) is at a lower level than the 
other atom (labelled 2). The lower atom (number 1) is in an excited state, 
of excitation energy E 0 . According to the theory of special relativity the 
mass of atom 1 is EJc 2 greater than atom 2 which is in the ground state. 
Consequently the heavier atom (number 1) will move vertically down¬ 
wards pulling atom number 2 upwards. Whilst it is moving downwards, 
atom 1 emits a photon of energy •®o» which is subsequently absorbed by 
atom 2. Atom 2 is now heavier than atom 1 and now atom 2 moves 
vertically downwards pulling atom 1 upwards. Some time after it has 
passed the level of atom I, atom 2 emits a photon of energy E 0 , which is 
subsequently absorbed by atom 1 which then moves vertically downwards. 
If the process is repeated indefinitely work is being done continuously. 
This is therefore an example of a perpetual motion machine. What is the 
fallacy in the above argument? [Hint: Remember the gravitational 
shift of spectral lines. Reference: Frisch 22 .] 
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EXPERIMENTS ON THE MOTION OF THE EARTH 
RELATIVE TO THE ETHER 


(a) Michelson-Morley Experiment 

The most famous of the optical experiments to measure the 
velocity of the earth relative to the hypothetical ether was the 
Michelson-Morley experiment 1 . The experiment will be inter¬ 
preted in terms of the mechanical theories of the ether which were 
in vogue in the nineteenth century. The reader should treat this 
Appendix as an exercise in the History of Science. He should 
attach the same significance to the mechanical ether theories as he 
does to the caloric theory of heat. If he prefers, the reader can 
identify the Absolute System, merely with a system in which the 
speed of light in empty space is the same in all directions. Eqns 
(Al.l), (A1.2) and (A1.3) then follow from the Galilean velocity 
transformations, eqns (1.10), (1.11) and (1.12). 

The physical principles underlying the Michelson-Morley experi¬ 
ment will be illustrated by using the analogy of a swimming race. 
Just as the wind carries sound with it, the movement of the ether 
was supposed to carry light waves with it. Similarly, a river carries 
a swimmer along with it. The flow of the river affects the speed of 
the swimmer, whatever the direction the swimmer is going. Let 
A and B be two points on opposite banks of a river which is flowing 
at a speed v relative to the ground as shown in Figure Al.l. Let l be 
the width of the stream from the diving board A to the other bank. 
Let C be a point which is at a distance l upstream from A. We shall 
consider a race between two swimmers who go from A to B and 
back and from A to C and back respectively. It will be assumed 
that each swimmer is able to swim with the same speed c in still 
water. The question is, which swimmer will win the race ? Obviously 
in still water (v = 0) it would be a dead heat, but how will the 
times of the swimmers compare when there is a current ? 

In order to allow for the current, the swimmer going from A to 
B must aim upstream such that his resultant velocity is in the 
direction A to B. This resultant velocity is equal to (c 2 — » 2 )*, so 
that the time to go from A to B is equal to Z/(c 2 — v 2 ) i . Similarly, 
on the return journey from B to A the swimmer must again aim 
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I - River flows with 

| velocity v 



Figure Al.l. A swimming race in which one swimmer swims from 
A to B and back whilst the other swims from A to C and back. 

According to the classical ether theories this is equivalent to the 
Michelson—Morley experiment 

upstream and his time is again equal to I/(c a — z> 2 )*. The total 
time to go from A to B and back is equal to 

ll == ^ AL1 ) 

The other swimmer will have a velocity c — v on the journey from 
A to C and a velocity c + son the return journey from C to A, so 
that the total time in this case is given by 

1 l 2 Ic _ 21 _ t x 

<2 C — V C + V C 2 — V 2 ~ c( 1 — V 2 jc 2 ) ~ y'l _ V 2J C 2, 

Now v must be less than c otherwise the second swimmer would 
never reach the point C, thus 1/(1 — v 2 ^ 2 )* must be greater than 
unity, so that t 2 is greater than t x by a factor 1/(1 — » 2 /c a )*. If one 
knew c and l and measured the two times t x and t 2 with a stop watch, 
one could estimate v. 

An optical experiment which, according to the ether theories, 
was equivalent to the above swimming race, was carried out by 
Michelson and Morley in 1887. The principle of their interfero¬ 
meter is illustrated in Figure A1.2. Light from a source S is divided 
at a half-silvered mirror A, part of the light going in the direction 
AB and part in the direction AC. The light is reflected by mirrors 
at B and C. The two pencils of light then combine to produce 
interference fringes, which are viewed by the telescope T. For an 
account of the theory of the Michelson interferometer the reader is 
referred to a text book on opdcs. 

For purposes of discussion it will be assumed that there is an 
‘ether wind’ of velocity v in the direction C to A in the laboratory 
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system as shown in Figure A1.2. In vacuo , and for all practical 
purposes in air, there is no dragging of the ether, so that according 
to the ether theories the velocity of light relative to the laboratory 
should be c — v in the direction A to C, c + v in the direction C 
to A, and (c 2 — a 2 )* in the directions A to B and B to A. The 
experiment is therefore equivalent to the swimming race. If 
AB = AC = l then the times for the light to go from A to B and 


B 



Figure A 1.2. Schematic form of the Michelson- 
Morley experiment 


back and A to C and back are given by eqns (Al.l) and (A1.2) 
respectively. 

Hence, 

At = t 2 — fj 

_ 2 J( _ 1 __ i _\ 

- c\(l -v^/c 2 ) VI - v 2 /c 2 l (A1.3) 

= -{(1 - v^c 2 )- 1 - (I - v 2 /c 2 )~i} 
c 

Expanding by the binomial theorem and neglecting terms of order 
greater than v 2 jc 2 , 

9/ i v z 

At = j{1 + v 2 /c 2 - 1 - K/r 2 } = — 

This corresponds to a path difference A 1; given by 

fo 2 

A* = c At — 
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If the interferometer is rotated through 90 degrees such that the 
ether wind is from A to B, then the time to go from A to B and back 
is greater than the time to go from A to C and back by an amount 
lv 2 jc 3 , and the path difference is now given by 



Hence, the total change in path difference on rotating the inter¬ 
ferometer through 90 degrees is equal to 2lv 2 /c 2 . If X is equal to 
the wavelength of the light used, then this change in path difference 
corresponds to a fringe shift of 

21 v 2 
n ~ X c 2 

Michelson and Morley* had a path length of 11 metres, and used 
a wavelength of 5-9 X 10 -7 m. If it is assumed that the ether wind 
is equal to the full velocity of the earth in its orbit around the sun, 
namely 30 km/sec, then v/c =2= 10 -4 , hence 

”“dNrn ^ x < 10 ^ 0 - 37 

This is a fraction of a fringe. To ensure accuracy and freedom from 
vibration, particularly when the interferometer was rotated, 
Michelson and Morley floated the apparatus on mercury and took 
their readings continuously as the whole apparatus rotated slowly. 
The maximum displacement found by Michelson and Morley was 
less than one hundredth of a fringe. They could have detected a 
fringe shift of 0-37 fringe quite easily. Now it might have happened 
that at the time they performed the experiment the earth might 
just have been at rest relative to the ether. However, when they 
performed the experiment at different times of the day and also 
six months later, when the earth was moving in the opposite 
direction relative to the sun, Michelson and Morley did not find 
any fringe shift. There have been many repetitions of the Michelson- 
Morley experiment, but apart from some results by Miller (1925), 
they all found no fringe shift. In 1958 Cedarholm, Bland, Havens 
and Townes 2 carried out an extremely accurate experiment on 
ether drift using two maser beams. Their results showed that the 
ether drift, if it exists, is less than 1/1000 of the earth’s orbital 
velocity. 

How was the null result of the Michelson-Morley experiment 
interpreted at the time it was performed ? Nowadays one tends to 

* Michelson and Morley’s account of their experiment is reprinted in Source 
Book in Physics by W. F. Magie, published by McGraw-Hill. 

293 



I 


PuYe Tfeat/Lcwtatieoi ti 


APPENDIX 1 


lose perspective about the significance of the null result of the 
Michelson-Morley experiment. It was for this reason that the 
swimming race analogy was used. The null result of the Michelson- 
Morley is equivalent to a dead heat in the swimming race. How 
would the reader try to account for a dead heat ? The first suggestion 
might be that the velocity of the river was zero. Another might be 
that the swimmers swam at the same speed despite the flow of the 
river, but this would not be consistent with the Galilean transforma¬ 
tions, if the swimmers were able to swim at the same rate relative to 
still water. One would soon start thinking that the lengths AB and 
AC were not equal, or that the watches used to time the swimmers 
were wrong. Similar interpretations were suggested to account for 
the null result of the Michelson-Morley experiment. One suggestion 
was that v = 0 always, that is, that the reference system in which the 
ether was assumed to be at rest was always at rest relative to the 
earth. However, this would give the earth an omnipotent position 
in the universe which people had been loathe to accept since the 
time of Copernicus. 

Another explanation offered was that the earth pulled the ether 
in its immediate vicinity around with it, so that the ether near the 
earth’s surface was always at rest relative to the earth, but the 
earth moved relative to the rest of the ether in the universe. To test 
the possibility that moving bodies might drag the ether surrounding 
them with them, Lodge carried out an experiment in which an 
interferometer was placed between two large steel discs lm in 
diameter. The steel discs were 2-5 cm apart. The steel discs 
were mounted on a common axle which could be rotated at a very 
high speed. The interferometer remained at rest during the experi¬ 
ment. If the ether between the plates was dragged by the plates 
when they rotated, then it was thought that there might be a dis¬ 
placement of the interference fringes when the rotation started. 
No displacement was observed and it was concluded that the ether 
was not dragged along by the rotating steel discs. 

Another explanation of the Michelson-Morley experiment was 
suggested by FitzGerald and independently by Lorentz. The 
suggestion was that bodies moving relative to the ether might be 
contracted in the direction of motion through the ether such that a 
length l 0 would be reduced to / 0 (1 — v 2 (c 2 )*, where v is the velocity 
of the body relative to the ether. Thus the interferometer arm AC 
shown in Figure A1.2 would be reduced to / 0 (1 — v 2 jc 2 ) i ; this 
would make t x and t 2 [given by eqns ( Al.l ) and (A 1.2) respectively] 
equal to each other. Since rulers would also be contracted in the 
same proportion, the length of AC would still be measured to be l. 
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At the time it was suggested, this was an ad hoc assumption, but 
Lorentz was later able to account for such a contraction in a causal 
way in terms of his electron theory (Rosser 3 , Section 2.4). Lord 
Rayleigh suggested that if a transparent medium did undergo a real 
contraction due to its motion relative to the ether, then it might 
become doubly refracting, just as a block of glass does when it is 
subjected to unilateral stress. Lord Rayleigh 4 attempted to detect 
the effect experimentally but with negative results. Brace 5 carried 
out the experiment with increased accuracy but also found no 
effect, so that motion relative to the hypothetical ether apparently 
produced no state of strain. 

Another possible interpretation of the Michelson-Morley experi¬ 
ment was that the velocity of light was the same in the direction 
AC as in the direction AB. Now according to the ether theories this 
would be equivalent to saying that the swimmers would always 
swim at the same speed relative to the bank so that the swimming 
race would be a dead heat, whatever speed the river was flowing. 
Such an explanation was incompatible with ether theories in which 
the velocity of light had to be related to a transmitting medium 
and the Galilean velocity transformations were used. This is, 
however, precisely one of the assumptions Einstein made in the 
theory of special relativity. According to the principle of the 
constancy of the velocity of light, the speed of light in empty space 
is the same in all directions of space in all inertial reference frames. 
The acceptance of this postulate necessitated the abandonment of 
the Galilean transformations and a revision of the concepts of 
space and time in the way described in Chapter 3. Einstein also 
postulated that the laws of optics and electromagnetism obeyed 
the principle of relativity, in which case, there is no absolute 
reference frame for the description of optical and electrical phe¬ 
nomena. 

Since the advent of quantum theory it has not been necessary to 
interpret light as a continuous wave motion in a continuous 
mechanical medium. It is now believed that light consists of indi¬ 
vidual photons. These photons are treated as one species of funda¬ 
mental particle. The wave properties of light are associated with 
individual photons in a similar statistical way to the way ‘wave’ 
properties are associated with individual electrons. The more 
advanced reader may be interested in trying to interpret the action 
of the Michelson-Morley interferometer from the quantum me¬ 
chanical point of view. (Reference: Frisch 8 ). (Where does the 
principle of the constancy of the velocity of light operate in this 
interpretation ?) 
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(b) Trouton-Noble Experiment 

Maxwell’s equations do not obey the principle of relativity, if 
the co-ordinates and time are changed according to the Galilean 
transformations. Before the advent of the theory of special relativity, 
it was believed that Maxwell’s equations would be valid in one 
reference frame only. If so, it should have been possible to identify 
this absolute system by means of electrical experiments, such as the 
Trouton-Noble experiment 7 . 



Figure A1.3. The Trouton-Noble experiment. Thecharges 
-\-q and —q are moving with velocity v relative to the 
absolute system in which it is assumed that Maxwell’s 
equations must be applied. Relative to the absolute co¬ 
ordinate system the magnetic fields due to the moving 
charges give rise to a couple tending to rotate the appa¬ 
ratus in the anti-clockwise direction. No such couple was 
observed when the experiment was performed 


Consider two equal and opposite electric charges + <7 and —q 
connected by a ‘rigid’ rod of length l which is at rest relative to the 
earth, as shown in Figure A 1.3. It is unlikely that, if the absolute 
system exists, the earth is at rest relative to it. Let the velocity of 
the earth relative to the absolute system be v. If Maxwell’s equations 
must be applied in the absolute system then there is a magnetic 
force between the charges shown in Figure A 1.3. Applying the 
Biot-Savart law one finds that the magnetic force on each charge 
due to the other charge is equal to 


r !H A 2 • 

f ~4rr P 6 
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These magnetic forces give rise to a couple of magnitude 


If ft 0 e 0 = 1 jc 2 then 


G-ZiglnOlone 

4tt r 


1 v 2 q 2 sin 6 cos 6 


4ve 0 c 2 


l 


(A1.4) 


This couple tends to rotate the apparatus in the anti-clockwise 
direction, as shown in Figure A1.3, and the apparatus should rotate. 
The electric force between the charges should be along the line 
joining them and should not contribute to the couple. By measuring 
the rotation it should have been possible to estimate v. An experi¬ 
ment of this type was carried out by Trouton and Noble who 
suspended a charged parallel plate capacitor by a phosphor bronze 
strip, such that the plates ol'the capacitor were in the vertical plane. 
If v were of the order of magnitude of the velocity of the earth in 
its orbit around the sun, then the couple given by eqn (A1.4), which 
is of order v 2 jc 2 , should have been observable in Trouton and 
Noble’s experiment, but the experiment failed to give any indication 
of such a couple, and it proved impossible to determine any absolute 
reference frame for electromagnetic phenomena. This suggests that 
the laws of electromagnetism satisfy the principle of relativity. 

It is simplest to interpret the Trouton-Noble experiment from 
the reference frame in which the two charges shown in Figure A 1.3 
are at rest. According to the principle of relativity, the laws of 
electromagnetism are valid in this reference frame, so that the 
force between the charges should be given by Coulomb’s law for 
the force between electrostatic charges; there should be no magnetic 
force between the charges in the inertial frame in which they are 
both at rest. The resultant force is along the line joining the 
charges and should not give rise to a couple in the reference frame 
in which they are at rest, and the charges should not be observed to 
rotate in any reference fram e. 

Some of the more advanced readers may like to try and interpret 
the Trouton-Noble experiment in the reference frame in which 
the charges are moving with uniform velocity v as shown in Figure 
A1.3. The couple predicted by eqn (A1.4) should still be present. 
How is it balanced out? Reference: Tolman 8 and Panofsky and 
Phillips 9 . 
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MAXWELL’S EQUATIONS AND THE PRINCIPLE OF 
THE CONSTANCY OF THE VELOCITY OF LIGHT 


For the benefit of readers familiar with Maxwell’s equations it will 
now be illustrated how the principle of the constancy of the velocity 
of light follows, if it is assumed that Maxwell’s equations are correct 
and obey the principle of relativity. 

Consider a field point P in empty space at x, y, z and at a time t 
in the inertial frame 27 as shown in Figure A2.1(a). Let the system 
of accelerating charges give rise to an electric and magnetic field at 
P. For a field point in empty space, B = fi 0 H and D = e 0 E so that 
in M.K.S. units, Maxwell’s equations take the form 


, ^ bh 

curl E = 

(A2.1) 

, „ 3E 

curl H = £ 0 

(A2.2) 

div E = div H = 0 

(A2.3) 


where e 0 and /j, 0 are the electric and magnetic space constants in 27. 
Taking the curl of eqn (A2.1) and substituting for curl H from eqn 


y 


2 


• I moves with 
— ^K-«-relative 


veloc i ty 
lo 2 


• P ( A,y,z,U 


• P (x',y,z',t') 




x’ 


(a) (b) 

Figure A 2.1 . {a) The distribution of accelerating charges gives rise to electromagnetic radiation 
which has a velocity (ji 0 e 0 )~i relative to E, at the point P in empty space, (b) The same 
accelerating charges shown relative to the inertial frame S'. The charges give rise to electro¬ 
magnetic radiation which has a velocity relative to S' at a point P' in empty space 
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(A2.2) one obtains 

curl curl E 


3 


-r*Jt 


3 2 E 

curl H = — /x 0 e 0 


Now curl curl E = grad div E — V 2 E. Since div E = 0 in empty 
space, one has 

3 2 E 

V 2 E - gj = 0 ( A2 - 4 ) 

Eqn (A2.4) is a wave equation and is interpreted as describing the 
behaviour of the electric vector of the electromagnetic radiation at 
the point P at a time t due to the accelerating system of charges. 
The velocity of these waves is equal to (jWo e o)~* and this velocity 
is identified with the velocity of light. This velocity is independent 
of the velocity of the charges giving rise to the electromagnetic 
waves, illustrating how according to Maxwell’s equations the 
velocity of light is independent of the velocity of the source of the 
light. 

Now consider the point P' situated at the point x',y’, £ in empty 
space at a time t’ in £' as shown in Figure A2.1(b) . We shall consider 
the electric and magnetic fields at P' due to the same system of 
accelerating charges as was previously considered relative to 27. In 
empty space, for the inertial frame 27', Maxwell’s equations become: 

curl' E' = — ,Mo (A2.5) 

curl' H' = €q (A2.6) 

div' E' = div' H' = 0 (A2.7) 

where e' 0 and /£q are the electric and magnetic space constants 
relative to 27'. The co-ordinates in the expressions for curl' and div' 
in eqns (A2.5)-(A2.7) are £ ,y, £. For example, div' H' is equal to 
(3 H'Jdx') + {dH'Jdy') + ( dH'Jdz '). The fields E' and H' must be 
defined in the same way relative to 27', as E and H are defined 
relative to 27. Taking the curl' of eqn (A2.5), one obtains the 
equation 

V'^E'-^4~ = 0 

The velocity of electromagnetic waves in 27' is therefore equal to 
(Po £ o) _i . This velocity is again independent of the velocity of the 
source of the radiation. 
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PRINCIPLE OF THE CONSTANCY OF THE VELOCITY OF LIGHT 
The ampere is defined as: 

That unvarying current, which, if present in each of two infinitely thin 
parallel conductors of infinite length and one metre apart in empty space, 
causes each conductor to experience a force of exactly 2 x 1 Or 7 newtons 
per metre of length. 

If Maxwell’s equations are valid in £ and £', then the force between 
two parallel currents should be given by the Biot-Savart law in both 
E and E'. If the ampere is defined in the same way in E and £', 
this would imply that f/, 0 = /j,q = 4tt . 10~ 7 by definition. For the 
velocity of light to have the same numerical value in E and E' 
one must also have e 0 = £q. In principle, according to classical 
physics, one could place two protons a known distance, say x 
metres, apart in empty space and measure the force between them 
when they are at rest in the inertial frame E. Similarly one could 
place two protons x metres apart, and measure the force between 
them when they are at rest relative to the inertial frame E'. If 
Maxwell’s equations are valid in both E and £', then Coulomb’s 
law should be applicable in both cases. For e 0 to be equal to e' 0 
it would require that the force between the protons at rest in E 
(measured in E) would have the same numerical value as the force 
between the two protons at rest in £' (measured in £’). Though 
it is not always stated specifically, it is generally assumed in the 
theory of special relativity, that, if two experiments are carried out 
under identical conditions in two inertial frames E and £' which are 
equivalent in every way, then they give the same numerical results, 
within experimental error. If this assumption is made, then e 0 
should equal £q. Since /x Q is equal to [i o by definition, then {fi 0 e 0 )~* 
should be equal to (/ i'qe' 0 )~K According to Maxwell’s theory these 
latter expressions are equal to the velocities of electromagnetic 
waves in free space in E and £’ respectively. Thus, if it is assumed 
that Maxwell’s equations are correct and obey the principle of 
relativity, then the principle of the constancy of the velocity of light 
follows. The simultaneous measurement of the velocity of light in 
E and £' is not carried out under identical conditions in E and £', 
since the velocity of the source of light is not the same relative to 
observers at rest in E and £' respectively, so that a priori there is no 
reason why they should obtain the same numerical value for the 
velocity of light; but according to Maxwell’s equations they should. 
With Gaussian units, for the velocity of light to have the same 
numerical value in E and E', the ratio of the electromagnetic to the 
electrostatic units would have to have the same numerical value in 
E and E. 
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Conversely, if it is assumed that the principle of the constancy 
of the velocity of light is valid, then this implies that s 0 — s'q if 
Maxwell’s equations are correct. Thus, according to the theory of 
special relativity, the force between two charges at rest in either 27 
or 27' should have the same numerical value under the same 
conditions in both 27 and 27'. According to the theory of special 
relativity 27 and 27' are completely equivalent for describing mag¬ 
netic as well as electrostatic phenomena, provided the units are 
chosen in the same way in 27 and 27'. 

The statement that the velocity of light has the same numerical 
value in all inertial frames implies that the fundamental units of 
length and time are defined in the same way in all inertial frames, 
e.g. as described in Section 1.2. 
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DERIVATION OF THE LORENTZ TRANSFORMATIONS 


(a) Using the Principle of the Constancy of the Speed of Light 
The co-ordinates and time of the event of detection of the light at 
the point P relative to E and E' in Figure 3.1 must satisfy the 
relations. 

* 2 + f + z 2 - cV = 0 (A3.la) 

and 

x' 2 +/ 2 + z' 2 - c 2 *' 2 = 0 (A3.1i) 

The required transformations must transform eqn (A3.H) into 
eqn (A3.la). It is illustrated in Section 3.7, using the principle of 
the constancy of the velocity of light, that 

y — y (A3.2) 

z = z' (A3.3) 

It will be assumed that the other transformation equations are 
linear and of the form 

x' --= Ax + Bt (A3A) 

x == Cx' + Dt' (A3.5) 

where A, B, C and D are constants independent of x, x', t and t', 
but the constants A, B, C and D may depend on v, the constant 
relative velocity of E and E'. 

If the transformation equations were not linear, for example, if 

x == Px + Qt 2 

then putting x' = 0 corresponding to the position of the origin of 
E' relative to E, we would have 

x=-(QIP)t 2 

in which case the origin of E' would be accelerating relative to E 
contrary to our assumption that E' and E are moving with uniform 
relative velocity. 

Equations (A3.4) and (A3.5) must describe the motions of the 
origins of E and E' relative to each other. For example, since the 
origin of E and E' coincide at t = t' = 0, the position of the origin 
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of S' relative to S is given by 

x = vt (A3.6) 

Putting x — 0 in eqn (A3.4) corresponding to the position of the 
origin of S', we obtain 

0 = Ax -f- Bt, or x — —(£jA)t (A3.7) 

Comparing eqns (A3.7) and (A3.6), 

v = —BjA or B = — vA 

Hence, eqn (A3.4) can be rewritten 

x = A(x — vt) (A3.8) 

If an observer at rest in S records the velocity of S' relative to S as 
along the +# axis, then an observer at rest in S' should record 
the velocity of the origin of S relative to S' as — v along the x' axis 
of S'. If they did not agree on the magnitude of their relative 
speed, it could only be due to motion in one direction of space 
compared with motion in the opposite direction of space. This 
would contravene the assumption that, relative to any inertial 
reference frame, all directions in space are equivalent. (Reference: 
Rosser 1 , page 90.) Hence the motion of the origin of S relative to 
S', corresponding to x = 0 in eqn (A3.5) must be given by 

x' = -vt' (A3.9) 

Putting x = 0 in eqn (A3.5) 

0 =Cx' + Dt' (A3.10) 


Comparing eqns (A3.10) and (A3.9) we conclude that D — Cv, so 
that eqn (A3.5) can be rewritten 

x = C(x' + vt') (A3.11) 

Substituting for x' from eqn (A3.8) into eqn (A3.11), one obtains 


that is 


x = C(Ax — Avt + vt') 

'-'HO-;*)] 


(A3.12) 


Substitute for x', y', z! and t’ from eqns (A3.8), (A3.2), (A3.3) 
and (A3.12), into eqn (A3. lb), namely 

+ ya + z '2 _ C 2 t 'z = o (A3. lb) 
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After rearranging, 


\ r 


Y2c 2 A 2 ( 

1 \ 1 

?)J 

| + xt 

V » (’ 

~ 3c) - “ 


+ f + z 2 - t 2 [c 2 A 2 - v 2 A 2 ] = 0 (A3.13) 


But x, y, z and t refer to tire co-ordinates and time of the event 
corresponding to the detection of the light at P (cf. Figure 3.1). 
Hence, for the same event, one has, from eqn (A3, la), 


X 2 + Z 2 - c 2 t 2 = 0 (A3.14) 

The coefficients of x 2 , xt and t 2 must be the same in eqns (A3.13) 
and (A3.14). Equating the coefficients of l 2 , 


or 


that is, 


c 2 A 2 - v-'-A 2 = c 2 


A 2 


A = 


1 


(1 - v 2 lc 2 ) 

1 

V(1 - v 2 /c 2 ) 


(A3.15) 


Equating the coefficients oi xt in eqns (A3.13) and (A3.14), 


Since, from eqn (A3.15), A 2 is not equal to zero, 



that is, 

jC- 1 - 

° “ (1 - v 2 jc 2 ) 

Substituting for A from eqn (A3.15), one obtains 

1 

C = =- 

Vl - v 2 lc 2 


(A3.16) 


(A3.17) 


Substituting for A and C from eqns (A3.15) and (A3.17) into eqns 
(A3.8) and (A3.11) respectively. 


x — vt 

Vl - v 2 jc 2 


and 
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Substituting for A from eqn (A3.15) and for (1 — 1 1 AC) from eqn 
(A3.16) into eqn (A3.12), one obtains 

(t — vxjc 2 ) 


t' = A 




V1 — f 2 /c 2 


(A3.19) 


From eqns (A3.18) and (A3.19), 

V + vx'/c*) 

VI - » 2 /c 2 


t = 


Collecting the transformations. 


v ' _ (* ~ ^ 

X Vl - v 2 \c 2 

y =y 

z! — z 


y(x — vt) 


t' 


(;t — vxjc 2 ) 

V1 - » 2 /c 2 


= y{t — vx]c 2 ) 


These are the Lorentz transformations. The reader can check that 
these transformations do transform eqn (A3.16) into eqn (A3.la). 
Collecting the inverse transformations, 


(V + vt') 

Vi - v 2 tc 2 


= yV + vt') 


y =/ 


(f' + vx'/c 2 ) 

V l - v 2 lc 2 


= y{t' + vx'lc 2 ) 


The Lorentz transformations were derived for a special case only. 
They can be derived more generally using the theory of linear 
transformations. (Reference: Stephenson and Kilmister 2 .) 

( b) Using the Principle of the Constancy of the Limiting Speed of Particles 

The experiment of Bertozzi 3 described in Section 5.4.2 showed 
experimentally that when an electron is accelerated in an electric 
field, the velocity of the electron does not tend to infinity, as pre¬ 
dicted by Newtonian mechanics, but tends to a limiting value of 
3 x 10 s m/s, namely the speed of light in empty space. If all 
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inertial reference frames are equivalent, the numerical value of the 
limiting or ultimate speed should be the same in all inertial frames 
(provided the units of length and time are the same). If the limiting 
speed were not the same in all inertial frames, due to the various 
values of the limiting speed, it would be possible to differentiate 
between the various inertial frames. This result will be called the 
principle of the constancy of the limiting speed for particles. It can 
be used to develop the Lorentz transformations. 

Consider an electron travelling at a speed very close to the 
limiting speed c along the x' axis of S'. Let it pass the origin oi S' 
at t' — 0 and be detected at x at a later time t' in S'. We have 
x'jt' — c. Now the speed of the electron should be greater relative 
to S than S', since S' is moving relative to S along the positive x 
axis. Thus one expects the speed of the electron to be close to c in 
S also. If the event of detection is measured to be at x at a time t 
in S, we have xjt — c. Since S' moves relative to S, x' < x so that 
we must have t' < t. This illustrates in terms of the motion of a 
high energy electron that time cannot be absolute. 

In Figure 3.1, assume that instead of a light pulse, an electron is 
passing the origins of S and S' at t = t' — 0 and is detected at the 
point P having co-ordinates x,y, z, t and x',y', z' and t' in S and S’ 
respectively. Let the speed of the electron be close to the ultimate 
speed c in S'. If its direction of motion is close to the x axis of S', 
its speed should be greater in S than S', since S' is moving with 
uniform velocity v relative to S along the x axis. Hence, the speed 
of the electron should be very close to c in S. Equations (3.1) and 
(3.3) are approximately valid for the motion of the electron from 
the origins of S and S' to the point P in Figure 3.1. One can then 
use the analysis of Appendix 3(a). 

Alternatively, one can develop the expressions for the velocity 
transformations. For the reasons given in Section 3.4 and Appendix 
3(a), assume that the transformations are given by equations 
(A3.4), (A3.2), (A3.3) and (A3.12) respectively, that is 

x' — A(x — vt); y'—y; z' = z; t’ = A(t — Ex) (A3.20) 

where A and E = (1 — l/AC)/r are constants. 

Proceeding as in Section 4.1, 


dx’ - - A(dx — v dt); d t' = A(dt — E dx) 

, _ dx' __ («, — a) 
dr 7 (1 - Eu x ) 


(A3.21) 


Now consider an electron moving with a speed very close to the 
ultimate speed c along the x’ axis of S’, that is u x — c. The electron 
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should move with a speed u x — c in E. Substituting in eqn (A3.21) 


gives 

Similarly, 


E — vjc 2 

, d/ djy u v 

“* = 5F = A{dt - Edx) ~ 4(1 - vujc 2 ) 


(A3.22) 


Assume that an electron moves in E such that u x = 0, u v — c. In E' 
the velocity of the electron should be u! =£* c having components 
«' = —D and u'„, where, since u x = 0, 


Now, 

Since 

Hence, 


u' y = uJA ^ cl A 

u 2 = u'f + «'/ 

u —c and = c/A, 

C 2 =£= Z, 2 + C 2 /d 2 

4 = 1/(1 - t> 2 /c 2 )* 


Substituting for 4 and £ in eqns (A3.20) gives the Lorentz 
transformations. 
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APPENDIX 4 


SPECIAL RELATIVITY VIA MECHANICS 


There is now a large amount of independent experimental evidence 
in favour of relativistic mechanics, so that one can now abandon 
the historical approach to special relativity via the principle of the 
constancy of the speed of light. For example, one can start by 
describing Bucherer’s experimental determination of ejm for fast 
electrons (cf. Section 5.4.4). It can then be shown how these 
results can be interpreted, if it is assumed that the mass of a body 
varies with its velocity u according to the equation 

m — m 0 l( 1 — u 2 /c 2 )^ 

Momentum p and force f can then be defined as 
p = mu — ?w 0 u/(l — u 2 /c 2 ) - 

and 

f = dp/di 


The equation of motion of a charge q of rest mass m 0 in an electric 
field E and a magnetic field B can then be written as 


d / m 0 u 
dflVl — zr/c 2 



-|-{nxB 


(A4.1) 


As in Section 5.4.5 it can be illustrated how this equation has been 
used to design proton synchrotrons capable of accelerating protons 
up to energies of 25 GeV. It should be emphasized that, in addition 
to the variation of mass with velocity, it is assumed in eqn (A4.1) 
that q is independent of the velocity of the charge. Equation (A4.1) 
is only valid if the charge is not radiating. The effects of radiation 
reaction can be illustrated in terms of the design of high energy 
electron synchrotrons. 

The relativistic mechanics of a single particle can now be 
developed precisely as in Section 5.3 leading up to the expressions 


T == (m — m 0 )c 2 
E == me 2 


li 


£2 == pV + my 
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At this stage one can solve the problem of an electron moving in 
an electric field (cf. Section 5.4.1) showing how the speed of the 
electron tends to c. The experimental confirmation of this result is 
described in Section 5.4.2. 

It can be seen from eqn (5.20) that associated with an increase 
in the kinetic energy of a particle of amount A T, there is an increase 
in its inertial mass equal to A.Tjc 2 = A.Ejc 2 . At this stage in 
elementary courses, it is generally postulated that the equivalence 
of mass and energy holds for all forms of energy including the rest 
mass energies of particles. One can then treat binding energy and 
nuclear reactions as described in Section 5.8.1. If it is assumed 
that the laws of conservation of linear momentum and mass-energy 
are valid; it is then possible, using the formulae developed above, 
to cover topics such as pair production, meson decay etc. in the way 
described in Chapter 5. The above theory is adequate for phenom¬ 
ena considered in one reference frame only, for example, the 
laboratory system. If one wants to use the c.m.-system as well as 
the laboratory system, one must develop the relativistic trans¬ 
formations. At this stage the Lorentz transformations could be 
presented as a natural consequence of the extension of the principle 
of relativity to these new laws of mechanics. In practice, it is 
simpler to develop the Lorentz transformations from postulates, 
such as the principle of the constancy of the limiting speed of 
particles (cf. Appendix 3 (b)). Alternatively, one could present the 
principle of the constancy of the speed of light as a special case of 
this postulate and proceed as in Chapter 3. However, even at this 
stage it is not necessary to proceed via the Lorentz transformations. 
If the new laws of high-speed mechanics obey the principle of 
relativity, then the rest mass energy of a particle should be the same 
when the particle is at rest relative to either E or E', so that m 0 c 2 
should be an invariant. Similarly, the inertial rest mass of a particle 
should be the same, whether the particle is at rest relative to either 
E or E', so that m 0 and hence c should be invariants. Using eqn 
(A4.2), we have 

c 2 p 2 - E* = c 2 p' 2 - E ' 2 = -my 

The transformations which satisfy these equations are 

Px = V(Px - vE/c 2 ); p' y = p v ; p' z = pE' = y(E - vp x ) 
y = 1/(1 - v 2 /cy (A4.3) 

where v is the velocity of E' relative to E along their common * axis. 
These transformations can be used to transform from the centre of 
mass to the laboratory system and vice versa. They can be applied 
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to photons to derive the formulae for the Doppler effect in the way 
described in Section 5.8.3. Since 

p x = mu x ; E = mc 2 ; p' x = m'u' x ; E' = m’c 2 etc., 

eqns (A4.3) can be rewritten 

m'u x — ym(u x — t); m'u' v — mu v ; m'u' z = mu z (A4.4) 

m'c 2 = ymc 1 (1 — vujc 2 ) (A4.5) 

Dividing eqns (A4.4) by eqn (A4.5) gives the velocity trans¬ 
formations : 

, _ (u x — v) , _ u v , _ u z 

Ux ~ (1 - vujc *) ’ “» == y(l - vujc•) * “ 2 y(l — 

(A4.6) 

At this stage one can develop the Lorentz transformations from the 
principle of the constancy of the limiting speed of particles in the way 
described in Appendix 3(b). Alternatively, one can compare eqns 
(A3.21) and (A3.22) with eqn (A4.6). One can therefore approach 
special relativity in two ways. In the approach used in the text, 
special relativity is presented as a consequence of the extension of 
the principle of relativity to optics and electromagnetism. Alterna¬ 
tively, one can use the approach outlined in this Appendix, in which 
relativistic mechanics is presented as a necessary consequence of 
experiments on high energy particles such as electrons, and the 
Lorentz transformations as a consequence of the extension of the 
principle of relativity to these new laws of mechanics. 
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THE MOSSBAUER EFFECT 


Emission of y-rays by a Free Nucleus 

Consider first y-ray emission by a free nucleus which decays by 
y-ray emission from an excited state of excitation energy E 0 to the 
ground state, as shown in Figure A5.1(a). If linear momentum is 
conserved in the decay, then the nucleus must recoil with the same 
momentum as the emitted photon, as illustrated in Figure A5.1{b). 
Hence, not all the excitation energy can go into the energy of the 
emitted photon. If E y is the energy of the photon emitted, its 
momentum is Ey/c so that the recoil momentum of the nucleus is 
given by 

Pr = Eylc (A5.1) 

The nucleus is so massive that its recoil velocity is generally very 
much less than the velocity of light, so that the non-relativistic 
formula can be used for its kinetic energy, namely 


T r = p 2 r /2M = (A5.2) 

where M is the mass and T r is the kinetic energy of the recoiling 
nucleus. From the law of conservation of energy 


E 0 — T r + Ey 


(A5.3) 


Ey — E 0 


2Mc 2 


(A5.4) 


Since E y < 2 Me 2 , E y is approximately equal to E 0 , so that eqn 
(A5.4) can be rewritten, to a good approximation, as 


Ey 


E o(' 


J&-) 

2 Me 2 / 


(A5.5) 


The relativistic formula can be calculated using eqn (5.116) 
(cf. problem 5.37). The energy of the photon E y is less than E 0 
by an amount T r , as illustrated in Figure -45.5(a). For two typical 
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cases which will be considered later and whose decay schemes are 
shown in Figure A5.2(a ) and (6) respectively, namely the 129 keV 
line in 191 Ir and the 14-4 keV line in 57 Fe, T r is equal to 0-045 eV 
and 0-002 eV respectively. 

Absorption of y-rays by a Free Nucleus 

Let a y-ray be incident upon a nucleus of the type illustrated in 
Figure A5.1(a), when the nucleus is in the ground state, and let 



(a) (b) 

Figure A5.1. The decay of a free nucleus by y-ray emission 

the y-ray be of the correct energy to excite the nucleus to the excited 
state of energy E 0 above the ground state. If linear momentum 
is conserved in the absorption process, the momentum of the 
recoiling nucleus after absorption will be equal to the momentum 
of the incident photon. The kinetic energy of recoil is again 



(a) (b) 

Figure A5.2. The excited states of ln lr and s, Fe 


approximately equal to ( E^/2Mc «) = T r , so that, in order to have 
a high probability of being absorbed, the incident y-ray must 
have an energy of E 0 + T r ^ E 0 ( 1 + Ff2Mc 2 ) as illustrated in 
Figure A5.3(a). Thus the energy of a photon emitted by a free 
excited nucleus is normally too low by an amount 2 T r to re-excite 
a similar nucleus from the ground state to the excited state. 
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Resonance Fluorescence 

So far it has been assumed that the excited level E 0 above the 
ground state always has the same energy. According to the un¬ 
certainty principle A E A t ~ h, so that, if the lifetime of the excited 


Natural line width 
r- 4-6 xIO' 9 eV 
Ifor the 14-4 keV level 
I in 57 Fe 
\T r is~2x10' 3 eV 


h~ T r 


(E 0 -r r )=energy 
of ay -ray emitted 
by a free nucleus 


b-T r —H 


-Energy 

Eo^excitation £o+r r =energy a y-ray 
energy requires to excite a free nucleus 


(o) 


The line profile 
is broadened 
due to the 
thermal 
Doppler 
effect 



Jf i 

r There is now some 
overlap Of the emission 
and absorption lines 

(f>) 


Energy 


Figure A5.3. In (a), the energy of a y-ray emitted by a free nucleus, and the energy a y-ray 
must have to be absorbed by a free nucleus, are shown, (b) The effect of thermal Doppler 
broadening is illustrated; the emission and absorption lines may overlap in some cases, so 
that nuclear resonance fluorescence is possible 


state is not infinitely long, it must have a finite energy width. 
According to quantum mechanics F the spread (or uncertainty) 
in energy is given by 

F = ; (A5.6) 

where r is the mean lifetime of the excited state, ft is equal to /?/2vr, 
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A being Planck’s constant. For example, for the 129 keV level in 
* *5 and the 14-4 keV level in 57 Fe, F is equal to 4-7 X 10" 6 eV and 
4-57 x 10 9 eV respectively; these values must be compared with 
the values of 0-090 eV and 0-004 eV for 2 T r for the same levels in 
r , arl j ^e. The natural line widths are not sufficient to make 
up the deficit of 2 T r , so that at first sight it would appear that it 
would generally be impossible to observe nuclear resonance fluores¬ 
cence with free atoms unless it were possible, by some method or 
other, to make up for the deficit of 2 T r in the energy of the y-rays. 
It was shown in Section 5.8.3 that when a source of light (or 
y-rays) is moving relative to the laboratory, then the momenta 
and energies of the photons in the laboratory are changed according 
to the momentum and energy transformations of the theory of 
special relativity. (This is the Doppler effect.) If the nucleus, 
which emits a y-ray, is moving relative to the laboratory when it 
decays, then the energy of the photon relative to the laboratory is 
changed, being increased if the photon is emitted in the same 
direction as the direction in which the decaying nucleus is moving. 
This leads to a spread in the energy of the emitted y-rays (the 
Doppler broadening) as illustrated in Figure A5.3(b). In the same 
way, if the absorbing nucleus is moving, then the absorption line 
is also Doppler broadened so that the emission and absorption line 
profiles may sometimes overlap as illustrated in Figure A5.3(b). 
Moon and his collaborators 1 placed a radioactive source on the 
tip of a rotator. They found that, if the rotor was rotated at very 
high speed, the Doppler shift was sometimes enough to make up 
the gap in energy of 2 T r , and they observed nuclear resonance 
fluorescence with the 412 keV level in 195 Hg. Since then other 
techniques have been developed for the study of nuclear resonance 
fluorescence. If the radioactive source is warmed up sufficiently, 
the thermal Doppler broadening may be sufficient for nuclear 
resonance fluorescence to be observed in some cases; this technique 
sometimes works even at room temperatures. Another method 
which has been used, is to observe cases when a y-ray is emitted 
following /3-decay. After emitting a /3-ray the nucleus is recoiling, 
and if the y-ray is emitted whilst the nucleus is still recoiling, the 
y-ray energy is sometimes Doppler broadened sufficiently to give 
rise to nuclear resonance fluorescence. If the absorbing nucleus is 
at rest, the probability for nuclear resonance fluorescence depends 
on the natural line width of the absorbing nucleus, which is related 
to the mean lifetime by eqn (A5.6). Nuclear resonance fluorescence 
techniques have been used extensively in the measurement of the 
lifetimes of the excited states of nuclei. (The reader is referred 

315 


~Pufuc. ~Ph.y.A.LC.A. 


APPENDIX 5 


to Metzger 2 for a review of the methods used and the results 
obtained.) 

The Mossbauer Effect 

If an excited nucleus forms part of the crystal lattice of a solid when 
it decays, then, under certain conditions, the y-ray may some¬ 
times be emitted with no momentum and kinetic energy transfer 
to the nucleus after decay. (For a discussion of the appropriate 
conditions the reader is referred to Burcham 3 and to Lustig 4 . It 
will be adequate for our purposes to accept that it can occur.) 


PI 


* 

Source 


/ 


© Detec tor 


Id 

Absorber 


Figure A5.4. An experimental arrangement for 
observing the Mossbauer effect. For a 67 Co source 
an iron absorber (generally enriched in 57 Fe) is 
used. 


In these cases of recoilless emission, the whole of the crystal 
effectively takes up the recoil momentum so that practically 
all the excitation energy goes into the emitted y-ray in these cases. 
The energy of the y-rays is (for most practical purposes) equal to 
E 0 , the excitation energy and the spread in the energy of the y-rays 
is due almost entirely to the natural line width arising from the 
finite lifetime of the excited state. This amounts to 4-7 X 10~ 6 eV 
for the 129 keV level in 191 Ir and 4-57 x 10 -9 eV for the 14-4 keV 
level in 57 Fe. Not all the decays are recoilless, but with 191 Ir and 
57 Fe the number of recoilless decays amounts to a few per cent or 
more. 

If the y-rays are passed through an absorber, which contains 
the same type of nuclei in the ground state as gave rise to the y-rays 
which were emitted without recoil, as shown in Figure A 5.4, then it 
is possible to have recoilless absorption of the y-rays. The recoil¬ 
less absorption line is also very sharp. Its width is approximately 
equal to the natural line width. This is much sharper than the 
thermal Doppler widths necessary for absorption by free atoms 
namely 4-57 X 10~ 9 eV compared with 0-002 eV for 67 Fe at 300°K. 
If the recoilless emission line and the recoilless absorption lines 
overlap significantly, then there is a strong probability that those 
y-rays which were emitted without recoil will be absorbed without 
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recoil when they pass through the absorber in the experimental 
arrangement shown in Figure A5.4. The nuclear resonance fluores¬ 
cence, under conditions when the emission and the absorption are 
both recoilless, is known as the Mossbauer effect, in honour of its 
discoverer. If the source (or the absorber) in Figure A 5.4 is moved, 
then the energy of the recoilless emission line (or recoilless absorp¬ 
tion line) is shifted due to the Doppler effect, and if the velocity of 
the source (or absorber) is high enough, the recoilless emission 
and absorption lines will no longer overlap significantly and the 


Emission 
tine 


Absorption 

line 




One line is Doppler 
shifted by the natural 
line width r 


4-6 x 10' 9 eV for the 
14-4 keV level of 57 Fe 


-The amount of overlap is 
decreased substantially 


(a) 


(b) 


Figure A5.5. The Doppler shift of the emission and absorption recoilless lines reduces the 

Mossbauer effect 


absorption of the y-rays from the source decreases since the Moss¬ 
bauer effect is no longer present, so that the detectors in Figure 
A5.4 records a higher counting rate. Mossbauer found that for 
the 129 keV line in 191 Ir it was only necessary to move the source 
at a speed of 1 -5 cm/sec to shift the recoilless emission line by its 
natural line width. This may be contrasted with the enormous 
speeds required to make up 2 T r in the case of y-ray emission by 
free nuclei with recoil. The Mossbauer effect absorption cross- 
section depends critically on the amount of overlap of the recoilless 
emission and recoilless absorption lines as illustrated in Figure 
A5.5. It can be seen that small shifts in the energy (or frequency) 
of either the recoilless emission or absorption lines changes the 
probability for the Mossbauer effect substantially and produces 
measurable effects. Measurements of the Mossbauer effect are an 
accurate method of measuring differences or changes in frequency. 
For example with the 14-4 keV line in 67 Fe the natural line width 
is 4-57 X 10 9 eV and it is possible to measure differences of energy 
(and frequency) less than the natural line width, so that changes in 
energy (and frequency) of the order of 1 part in 10 12 can be detected. 
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The energy of a 14-4 keV y-ray corresponds to a frequency of 3-46 x 
10 18 c/sec. The method has proved accurate enough to be used in 
the study of the gravitational shift of spectral lines. 

In his original experiments, Mossbauer used the 129 keV y-ray 
from 191 Ir. The recoilless resonant absorption amounted to 
approximately 1 per cent of the total absorption cross-section at 
liquid nitrogen temperatures, most of the emission being with 
recoil. After Mossbauer’s discovery a search was made for other 
substances which could be used, and many more have since been 
found. The substance which has been used most extensively is 
67 Fe. A source of 67 Co is generally prepared by deuteron bombard¬ 
ment of an iron target (which contains 58 Fe) in the form of a strip, 
the reaction being 

56 Fe + 2 H-> 57 Co + n 

The 57 Co is produced near the surface of the iron strip. The S7 Co 
decays by electron capture to excited states of 57 Fe and a substantial 
fraction of the decays go via the 14-4 keV level in 67 Fe, which has a 
mean lifetime of T4 X 10~ 7 sec, as illustrated in Figure A5.2(b). 
The lifetime of the 14-4 keV state is long enough for the excited 
67 Fe ions to occupy suitable sites in the iron crystal lattice before 
they decay. It is then possible for them to decay without recoil. 
The 14-4 keV y-rays can be passed through an iron absorber which 
can be enriched in 57 Fe so as to increase the probability of recoil¬ 
less absorption. The y-rays can be detected by a proportional counter 
or scintillation counter. The probability of recoilless resonance 
fluorescence is increased in this way and is substantially higher than 
in 191 Ir, and under favourable conditions may amount to 10 per 
cent or more of the total absorption. With 67 Fe the source and 
absorber can also be used at room temperatures which is a great 
advantage. 

For a fuller review of the Mossbauer effect the interested reader 
is referred to the articles by Burcham 8 and Lustig 4 . 
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RADAR METHODS AND THE ^-CALCULUS 


(a) Introduction 

Some readers may find it simpler to visualize the determination 
of the co-ordinates and times of distant events, in terms of the radar 
methods described in Section 3.9, rather than use all the para¬ 
phernalia of imaginary rulers and synchronized clocks distributed 
throughout space, so that observers, distributed throughout space, 
can record the co-ordinates and times of events, when and where 
they happen. Radar methods will be used in this Appendix to 
develop the Lorentz transformations and to illustrate time dilation 
and length contraction. The method used is known as the K- 
calculus. The pioneer of this method was Bondi 12 , whose approach 
we shall follow. Before proceeding with this Appendix, the reader 
should familiarize himself with Section 3.9 of Chapter 3. 

It will be assumed that there is a radar station at rest on the earth 
(the laboratory system 27), which can transmit and receive back 
radar pulses, which travel with the speed of light. Assume that 
there is an observer called John in this radar station. Assume that 
there is also a clock in the radar station on the earth, which can 
record the times of transmission and receipt of radar signals. It 
will be assumed that the radar stadon and observer John are at 
rest at the origin of the inertial frame 27. 

Consider a rocket moving directly away from the earth with uniform 
velocity v, with a lady astronaut, called Mary, on board. Let the 
rocket be at rest at the origin of the inertial frame 27'. Identify 
the common x axis of 27 and 27' with the direction of motion of 
Mary s rocket relative to the earth. It will be assumed that there 
is a radar set on the rocket, which can transmit and receive radar 
signals. It will be assumed that Mary has a clock on the rocket, 
of identical construction to John’s clock. [Some readers may prefer 
to consider both John and Mary in moving rockets, as shown in 
Figures 3.4, 3.5 and 3.6 of Sections 3.6 and 3.7. For example, John 
could be on rocket 1 and Mary on rocket 2.] 

Let John send out a radar signal from the earth at a time t z on 
his clock. Let the radar signal be reflected at a distant event on 
the x axis, and let the reflected radar signal reach the earth at a 
time t a on John’s clock. Following the analysis of Section 3.9, if 
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John assumes that the speed of radar signals (or light) is the same 
and equal to c in all directions, John will determine the following 
position (#) and time (£) for the distant event: 

* = ^ (* a — h ); 1 ~ + h) (A6.1) 

These equations will be used with different symbols for x, t, fj. and 
t z throughout this Appendix to measure the co-ordinates and times 
of distant events relative to John’s reference frame, the laboratory 
system E. 

Similarly, let Mary send out a radar signal from her rocket at 
a time t[ on her clock. Let the signal be reflected from the event 
on the x' axis, and let the reflected signal reach Mary s rocket at 
a time i' on Mary’s clock. Mary will then determine the following 
position (*') and time (t') for the event: 

*' = £( 4 - 4 ); t = |(4 + 0 ( A6 - 2 ) 

These equations will be used throughout this Appendix to determine 
the positions and times of events relative to Mary’s rocket. Notice 
the same value used for c in eqns (A6.1) and (A6.2), so as to be in 
accord with the principle of the constancy of the speed of light 
which is taken as axiomatic in this Appendix. Rather than use radar 
or light signals, some readers may prefer to use science fiction space 
guns to send messages. These space guns would shoot electrons at 
such a high speed, that their speed relative to both John and Mary 
would always be very, very close to c, and the analysis using radar 
methods in this Appendix would still be applicable. 

(6) Relative Speed of the Earth and Rocket 

Let John send out a series of radar signals from the earth at 
known times on his clock. Let these signals be reflected by Mary’s 
rocket, and let John determine, on his clock, the times at which he 
receives these reflected signals. Using eqn (A6.1), John can get a 
series of values for the positions of Mary’s rocket at various times, 
and hence calculate the speed of Mary’s rocket relative to the earth. 
Similarly, by transmitting radar signals, which are reflected by the 
earth, Mary can determine the speed of recession of the earth 
relative to her rocket. 

John and Mary must agree on their speed of separation. If they 
did not obtain the same value for their relative speed, it could only 
be due to motion in one direction of space compared with motion 
in the opposite direction of space. This would be contrary to our 
assumptions that space is isotropic, and that the speed of radar 
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signals (light) is the same in all directions in all inertial frames. 
Hence, John and Mary must agree on their speed of separation 
which will be denoted by v. 

(c) The K-Calculus 

A space-time diagram, or displacement-time graph will be used 
to represent the positions and times of events relative to John’s 
reference frame (27), which is the laboratory system. As described 
in Section 3.9, it is conventional to plot m = ct against x, as shown 
in Figure A6.1(a), where * is the position and t is the time of the 
event, or world point, measured using John’s radar set. The dis¬ 
placement of Mary’s rocket relative to the earth is shown in Figure 
A6.1(a). If Mary and John set both clocks to zero (that is t = t' = 0), 

John’s Mary’s James' 
world world world line 

John’s 
world 
line 

(Earth > Mary’s 

m -ff • . . . 


Figure A6.1. (a) John, who is at rest on the earth, transmits radar signals from the 
earth at the times 0, T 0 , 2 T„, etc. on his clock. The world lines of the signals are 
shown dotted in Figure A6.1(a). They are at 45 degrees to the x and m - ct axes. 
Mary is m a rocket moving away from the earth with uniform velocity v relative to 
John. John’s radar signals reach Mary at times 0, T’, IT', etc. on Mary's clock. 
The signals reflected by Mary’s rocket return to the earth at times 0, T, IT, etc. on 
John’s clock. Later, in Section ( j), we shall consider a third inertial observer, James, 
in a rocket at rest relative to the earth. James receives John's radar signals at time 
intervals of T 0 on his clock, (b) An enlargement of part of Figure A6.1(a), showing 
the radar signal John transmits at T„ on his clock, which is reflected by Mary’s rocket 
at a time T’ on Mary's clock, and which returns to earth at a time T on John’s clock. It 
is shown that T’ = KT„ and T =-- KT’ = A a r o , where K equals (I + v/e)*/ 

(l - »/«)* 

321 


~PuJnc. Tfogt/tcwuitica/ ~Ph.y-A.LCLA. 





APPENDIX 6 


when Mary’s rocket passes directly over John’s radar station on 
the earth, then Mary’s rocket goes a distance * = vt == {vjc)ct in a 
time t relative to John. Hence Mary’s world line Om is at an angle 
cf> = tan -1 (w/c) to the m — ct axis in Figure A6.1(a). Throughout 
this Appendix it will be assumed that world lines and world points 
are plotted relative to John’s reference frame S, using the Ox and 
Om axes, though we shall take the liberty of labelling the times of 
events on Mary’s world line, Om', as the times measured by Mary’s 
clock. 

Let John send out radar signals at times t = 0, T 0 , 2 T 0 etc., as 
measured by John’s clock at rest on the earth. The world lines of 
these radar signals are at 45 degrees to the x and m — ct axes in 
Figure A6.1{a), since in a time A t light travels a distance A x = c A t. 
Let these radar signals be received by Mary’s radar set at times 
0, r, 2 V, etc., as measured by Mary’s clock on the rocket. These 
time intervals must be equally spaced, if Mary is leaving the earth 
with uniform velocity v. Let 

T' = KT 0 (A6.3) 

where T 0 is measured on John’s clock on the earth, and T‘ by Mary’s 
clock on the rocket. Now, assume that, whenever Mary receives a 
signal from the earth, she sends a radio message back to the earth, 
without time delay, at times 0, T’, 2 T', etc., on Mary’s clock. 
These could be the reflections of John’s signals. The world lines 
of these signals are also shown dotted in Figure A6.1(a). Since the 
earth is moving away from Mary’s rocket with uniform velocity v, 
the signals reaching John on the earth, should also be equally 
spaced, say at times 0, T, 2 T, etc. Let 

T= K'T' 

where Tis measured by John’s clock on the earth and T' by Mary’s 
clock on the rocket. According to the postulates of special relativity, 
all directions in space are equivalent and the speed of light in 
empty space is the same in all directions in all inertial reference 
frames, so that there is nothing to differentiate John from Mary. 
They are just moving apart with uniform velocity v relative to each 
other in empty space. If Mary measures the interval between 
John’s equally spaced signals to be lengthened by a factor K, when 
she is going away from John with uniform velocity v, then John 
should measure the time interval between Mary’s equally spaced 
signals to be increased by the same factor when John is moving 
away from Mary with the same uniform velocity o. Hence, we 
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conclude that K = K', so that using eqn (A6.3) 


T — KT' = K 2 T 0 (A6.4) 

Consider the radar pulse which John transmits from the earth at 
the time T 0 , on his clock, and which is reflected by Mary’s rocket 
at time T on Mary’s clock, and which arrives back on the earth 
at a time T on John’s clock, as illustrated in Figure A6.1(b). Treating 
the time between t — 0 and t = T 0 on John’s clock on the earth as 
one interval, we can apply eqns (A6.3) and (A6.4) to these times. 
Hence, provided Mary’s rocket leaves the earth at t = t' = 0 and 
travels directly away from the earth with uniform velocity, so that 
Mary s world line is a straight line through the origin, as shown in 
Figures A6.1(b) then 


T' = KT 0 
T = KT' = F?T n 


(A6.5) 

(A6.6) 


These are the basic formulae of the A^-calculus. The quantities T n , 
T and T are illustrated in Figure A6.1(b). Summarizing , provided 
Mary’s world line goes through the origin, if John transmits a radar 
signal at a time T 0 measured by using his clock on the earth, this 
signal reaches Mary’s rocket at a time T' = KT 0 on Mary’s clock, 
and after reflection by the rocket this signal reaches the earth at a 
time T = K 2 T 0 measured on John’s clock. If Mary transmits a 
radar signal from the rocket at a time T', on her clock, the signal 
reaches the earth at a time T = KT’ measured on John’s clock on 
the earth. The reader should become thoroughly familiar with 
eqns (A6.5) and A(6.6) and learn to recognize them when they are 
used with different symbols for T 0 , T' and T in this Appendix. 
The value of the constant K will now be determined. 

Consider the radar signal transmitted from the earth at a time 
T a on John’s clock and received back after reflection from Mary’s 
rocket at a time T on John’s clock. Using eqn (A6.1), John finds 
the following position for Mary’s rocket 

* = |(T-r 0 ]; f = £[r+r 0 ] 


If Mary’s rocket is moving at uniform speed v relative to the earth, 
since the rocket leaves the origin (x = 0) at the time t = 0, 


v 

c 


X T-T 0 
ct T + T 0 
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Using eqns (A6.5) and (A6.6), 


v K % T 0 -T 0 K 2 - 1 

'c ~ K 2 T 0 + T 0 X 2 + 1 

Rearranging, and taking the square root, we find 


(A6.7) 


K = 


1 -f o/c 
1 - vjc 


(A6.8) 


This determines the factor K. It is illustrated in Section (j) of this 
Appendix that eqn (A6.5) is an example of the Doppler effect. 
Two important expressions involving K will now be calculated 


K , 1 = li+jt 4 . I- _ ?£ 

A + K V 1 - v/c + V 1 + o/c 

(1 + v/c) + (1 - vjc) 


Hence, 

1 

K 2 + 1 

Vl - V 2 /c 2 

2 


k + k = 

K 

Vl - v 2 lc 2 

Similarly, 

1 

K 2 - 1 

2 ojc 


K ~ K 

“ K 

II 

< 

1 ■ 
'to 

where 


y = ( 1 



(A6.9) 


(A6.10) 

(A6.ll) 


(d) The Lorentz Transformations 

It will be assumed that John is at the origin of 27, that Mary’s 
rocket is at the origin of 27', and that Mary’s rocket passes John’s 
radar station on the earth at t = t' = 0. Let an event occur at a 
point on the x axis, at a point further away from the earth than Mary’s 
rocket. Let John transmit a radar signal at a time (i xjc), 
measured on his clock. Let this signal be reflected at the event, and 
be received back on the earth at a time {t - j- xjc) on John’s clock as 
illustrated graphically in Figure A6.2[a). Using eqns (A6.1), John 
estimates the position of the event as x at a time t. Let the radar 
pulse pass Mary’s rocket at a time (f' — x'jc) on Mary’s clock, a.nd 
let the signal reflected by the event pass Mary at a time {f + x'jc) 
on Mary’s clock, so that, using eqn (A6.2), Mary calculates the 
position of the event as x' at a time f'. 
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Figure A6.2. (a) Lorentz transformations. John transmits a radar signal, shown dotted in 
rigure Ab.2{a), at a time (t — xfl) on his clock. After reflection at a distant event the signal 
returns to the earth at a time (t + xjc) on John's clock. The radar signal passes Mary at 
times (t — x /c) and (t' + x'/c) on Mary's clock, (b) Time dilation. Two events occur on 
Mary s rocket at times t{ and ti, measured on Mary's clock. John transmits radar signals, at 
times 1 3 and £ 4 on his clock which, after reflection at events 1 and 2 respectively, return to the 
earth at times f 5 and t % respectively, measured on John's clock 

Treating the time from 0 to (t — x/c) on John’s clock as T 0 and 
the time from 0 to (/ — x'/c] on Mary’s clock as T', we have from 
eqn (A6.5), 

ct' — x’ = K(ct — x) = Kct — Kx (A6.12) 

Treating the interval between 0 and ( t’ + x'/c) on Mary’s clock as 
T' and the interval between 0 and (t + xjc) on John’s clock as T, 
from eqn (A6.6), we have 

ct + x = K(cf + x') 
or 

/ , / ct x 

cf +* , =K + j t (A6.13) 

Subtracting eqn (A6.12) from eqn (A6.13) gives 

2 *' = «(i + *) - “{* - i) 
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Substituting for (K + 1/X) and (K — 1/ff) using eqns (A6.9) and 
(A6.10) respectively, gives 

x' = y{x — vt) (A6.14) 

where 

y = (1 — z) 2 /c 2 ) _i (A6.ll) 

Adding eqns (A6.12) and (A6.13) gives 

2ci' = ct{^ + Kj - x(k - ^ 

Using eqns (A6.9) and (A6.10), we obtain 

t’ = y(t — vx/c 2 ) (A6.15) 

Equations (A6.14) and (A6.15) are the Lorentz transformations, 
where x and x r are the positions and t and t are the times of the 
event measured by John and Mary respectively using radar methods. 
When using eqns (A6.1) and (A6.2), it was assumed that the speed 
of light was equal to c in all directions of empty space, relative to 
both John and Mary. This is the principle of the constancy of the speed of 
light, so that the present derivation of the Lorentz transformation is 
based on precisely the same axioms as the development in Section 
3.4. In the present case, it is easier to visualize how the co-ordinates 
and time of the distant event are determined. Solving eqns (A6.14) 
and (A6.15) for x and t gives the inverse transformations, 

x = y(x' + vt ’); t = y(t’ + vx’lc 2 ) 

So far only the measurement of events in the direction of relative 
motion has been discussed. The example used in Sections 3.6 and 
3.7 of Chapter 3, and illustrated in Figure 3.6, is in the spirit of the 
approach adopted in this Appendix, since light signals were used to 
send information from the events in Figure 3.6 to two observers C 
and C’ in uniform relative motion. For example, one could put 
John in rocket 1 and identify him with C, and put Mary in rocket 2 
and identify her with C’ in Figure 3.6, and then use the argument of 
Section 3.7 to show that 

y =/ (A6.16) 

z = z' (A6.17) 

This completes the Lorentz transformations. After developing the 
Lorentz transformations, we could now use them to develop the 
expressions for time dilation, length contraction etc., in the algebraic 
way described in Section 3.5. For the benefit of readers, who 
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prefer to use the geometric methods of the A-calculus, the expressions 
for time dilation and length contraction will now be developed. 
Without using the Lorentz transformations. The principle of the 
constancy of the speed of light will, however, be taken as axiomatic. 

(e) Time Dilation 

Let two events 1 and 2 happen on Mary’s rocket at times t[ and t' 2 
respectively, as measured by Mary’s clock on the rocket. Let John 
be fortunate enough to send a radar signal from the earth at a time 
f 3 , on his clock, such that it reflected from Mary’s rocket at the time 
of the first event on her rocket, and let this reflected signal reach 
John at a time t s on John’s clock, as illustrated graphically in 
Figure A6.2(b). Let John send a second radar pulse at a time on 
his clock, such that it is reflected from Mary’s rocket at the time of 
event 2, and returns to John’s radar station on the earth at a time 
t e on John’s clock as illustrated graphically on Figure A6.2(b). 
Mary estimates the time interval between the events as 

to' = /'-*' 

Using equations similar to eqns (A6.1), John estimates the time 
between the two events as 

to = i(t s + f 4 ) - |(* 5 + t 3 ) (A6.18) 

Using eqns (A6.5) and (A6.6), we have 

h Kt 3 = tJK 
t 2 — Kt t = t e /K 

Substituting in eqn (A6.18) for t 3 , / 4 , t 6 and t 9 gives 

Using eqn (A6.9) gives 

a At' 

VI — v 2 /c 2 v ' 

This is an example of time dilation (cf. Sections 3.5.2 and 3.10). 
The time interval measured by Mary’s clock is a proper time 
interval, since both events happen on the rocket. 
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(/) Length Contraction 

Consider the measurement of the length of Mary s rocket by both 
John and Mary. The world lines of the ends of Mary’s rocket are 
illustrated in Figure A6.3{a). Let John transmit a radar signal at a 
time t u on his clock, such that it is reflected from the front of Mary s 


John’s 
world line 



Figure A6.3. (a) Length contraction. The world lines of the front and rear of Mary's 
rocket are shown. John transmits radar signals at times f x and i, on his clock. These 
signals are reflected at the front and rear of Mary’s rocket at world points Q and P 
respectively. Using radar methods , John calculates P and Q to be simultaneous, (b) 
Minkowski diagram. It is suggested that Ox' and Om' could be used as oblique axes 
to represent the co-ordinates and times of events relative to Mary s rocket 

rocket at the world-point Q in Figure A6.3(a), and returns to the 
earth at a time t 4 , on John’s clock, as illustrated graphically in Figure 
A 6.3(a). John estimates the position of the front of Mary’s rocket as 

x q = C ~2 (h — h) > l Q — i(*4 + h) (A6.20) 

Let John transmit a second radar signal at a time t 2 on John s clock, 
such that it is reflected at the rear of Mary’s rocket at the world- 
point P in Figure A6.3(a) and returns to the earth at a time t 3 on 
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Johns clock, as illustrated in Figure A6.3(a). John estimates the 
position of the rear of Mary’s rocket as 

x p ~ 2 (*8 h)> tp = + q) (A6.21) 

Now, in order to measure the length of the rocket, John must 
measure the positions of the front and rear of Mary’s rocket at the 
same time, as measured by John’s radar experiments. Hence, t Q 
must equal tp at the world points P and Q, as shown in Figure 
A6.3{a). Hence, we must have 

U + h = *a + h (A6.22) 

Subtracting eqn (A6.21) from eqn (A6.20) and using eqn (A6.22), 
John concludes that the length of the moving rocket is 

l = Xq Xp c(q q)/2 c(q — q)/2 

l — e(q q) = e(q q) (A6.23) 

Let Mary stand at the rear of her rocket. According to eqn (A6.5), 
the radar signal, transmitted by John at a time q on his clock passes 
the rear of Mary’s rocket at a time Kt 1 on Mary’s clock. According 
to eqn (A6.6), after reflection from the front of Mary’s rocket at the 
world point Q in Figure A6.3(a), this radar signal passes Mary at the 
rear of her rocket at a time tJK on her clock. Using eqn (A6.2), 
Mary therefore concludes that the distance from the rear to the 
front of her rocket is 

l ° := 2 (i - Kt 

Rearranging, 

2^o ™ 

< 4 = ~ + K% (A6.24) 

From eqn (A6.6), we obtain in the present case, 

*s = K % (A6.25) 

Subtracting eqn (A6.25) from eqn (A6.24) gives 

mt, - q) 

From eqn (A6.23), both (q — t z ) and (q — q) are equal to Ijc. 
Hence, 

l == 2 Kl 0 - KH 
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Rearranging, and using eqn (A6.9), gives 

= (A6.26) 

This is the expression for the Lorentz contraction. John measures 
the length of Mary’s rocket to be less than Mary does. 

(g) Simultaneity of Spatially Separated Events 

Consider the two events of John’s measuring the extremities of 
Mary’s rocket at the world points P and Q in Figure A6.3(a). 
Relative to John’s reference frame 27, these events are simultaneous. 
Mary records the event at P at a time t' P , where according to eqns 
(A6.5) and (A6.6), 

t’ F = = | = \{Kt 2 + |) (A6.27) 

According to eqn (A6.5), the signal which John transmitted at the 
time t 1 on his clock, passes the rear of Mary’s rocket at a time Kt x 
on Mary’s clock, whereas according to eqn (A6.6), the signal, 
reflected from the front of Mary’s rocket at the world point Q, 
passes the rear of Mary’s rocket at a time tJK on Mary’s clock. 
Mary concludes that the time of the event at the world point Q, is 

<i - + |) (A6-28) 

Since from eqn (A6.22), (t 2 — t x ) equals (t i f 3 ), subtracting eqn 
(A6.28) from eqn (A6.27) gives 

tp-t’ Q = iK(h - h) + Wz ~ *«)/* 

= h ) (A6.29) 

Subtracting eqn (A6.21) from eqn (A6.20) and using eqn (A6.22) 
gives 

x q — x P = e(/ 4 — ^i)/2 c(t 3 t 2 )/2 

= c(t t — t 3 )/2 + c(t 2 ^i)/2 = c(t 2 fjJ 

Substituting ( x Q — x p )jc for t 2 — t x and for (k — from eqn 
(A6.10) into eqn (A6.29) gives, ' 

t'p — t' Q = yv{x Q - x P )jc 2 

This is in agreement with the Lorentz transformations, (cf. Section 
3.6). Thus, according to eqns (A6.27) and (A6.28), Mary does not 
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record the events at P and Q as simultaneous. Thus simultaneity is 
not an absolute property, as it was assumed to be in Newtonian 
mechanics. Mary concludes that John did not measure the ends 
of her rocket at the same time on her reckoning, (cf. Sections 
3.5.1, 3.6 and 3.7 of Chapter 3). The reader should remember that 
we took the principle of the constancy of the speed of light as 
axiomatic, when applying eqns (A6.1) and (A6.2). 

(A) Minkowski Diagrams 

In Figures A6.1, A6.2 and A6.3, the space-time or displacement¬ 
time diagrams always represented the co-ordinates and times of 
events relative to the earth, that is John’s reference frame 27. Now 
the world line of Mary’s rocket corresponds to the motion of the 
origin of 27 relative to 27, that is the line x' — 0. Hence, it might 
be possible to use the line Om' in Figure A6.3(b) as a time axis to 
represent the times of events relative to Mary’s reference frame 27'. 

Consider an event at the world point Q in Figure A6.3(b). Let 
John transmit a radar signal at a time t x on his clock, which after 
passing Mary’s rocket at the world point N, in Figure A6.3(b ) is 
reflected at the event at the world point Q and after passing Mary’s 
rocket at the world point S reaches the earth at the time t z on John’s 
clock. If the line Om' were used as a time axis for Mary’s reference 
frame, the time, corresponding to the event at Q in Figure A6.3{b), 
must be at the point R on the Om' axis, which is half-way between 
the points S and N, if Mary uses eqn (A6.2) to calculate the time of 
the event at Q. Thus events along the line RQ in Figure A6.3(b) 
must be measured to be simultaneous relative to Mary’s rocket, 
if Om is used as Mary’s time axis. The line corresponding to t' = 0, 
should be a line through the origin 0 parallel to RQ. Hence it 
might be possible to use Ox', which corresponds to t' = O, as the 
x' axis to represent the co-ordinates of events relative to Mary’s 
rocket (27'). If the world line of Mary’s rocket, that is the line Om’, 
makes an angle <j> to the m — ct axis in Figure A6.3(b), then, since 
radar signals are at an angle of n/4 relative to both the x and m = ct 
axis in Figure A6.3{b), the angles RNQ and RSQ are (tt/4 - $) and 
(tt/4 + <j>) respectively. If the angle PQR is a, then the angles 
SQR and RQN are (tt/4 -j- a) and (ir/ 4 — a) respectively. Hence 
in the triangle QRN, 


NR 

m 



(A6.30) 
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In the triangle RQS 



Since NR = RS, equating the right-hand sides of eqns (A6.30) and 
(A6.31) shows that a = </>. Hence RQ, and hence Ox , are inclined 
at the same angle <j> to the Ox axis as the Om' axis is inclined to the 
Om axis in Figure A6.3{b). Thus Ox' and Om can be used as oblique 
axes to represent the co-ordinates of events, relative to Mary’s 
rocket (27'). It is shown in Chapter 6, that, if we choose the lengths 
of the units of x' and m! = ct' appropriately, that is using eqns (6.5) 
and (6.6) of Chapter 6, then one can use the same world points and 
the same world lines relative to Mary’s rocket (27 ), that is the 
oblique Ox', Om' axes, as relative to John’s reference frame 27, that 
is the rectangular Ox, Om axes. The world points such as Q and 
world lines are then invariants. For example, the position of the 
point Q need not be changed relative to the Ox', Om' axes. The more 
adventurous reader may prefer to proceed to Chapter 6 for a full 
discussion of Minkowski diagrams in Section 6.2, and may like to 
develop the transformations of the theory of special relativity using 
the more elegant 4-vector methods in Section 6.4, before returning 
to consider the applications of these transformations in Chapters 3, 
4 and 5. 


(i) Velocity Transformations 

Consider a rocket moving along the * axis, away from both John 
and Mary, with astronaut Peter abroad. Let Peter’s speed relative 
to Mary be w, as measured by Mary using radar methods. Let 
Peter’s speed relative to John, who is at rest on the earth, be u as 
measured by John using radar methods. Let 



If John transmits radar signals at intervals T 0 on his clock, according 
to eqn (A6.5), they are received at time intervals ofF^X 1 ,, by Mary, 
as measured by her clock, and time intervals of K 3 T 0 by Peter, as 
measured by his clock. Now, if Mary transmits signals at intervals 
of KfT 0 , on her clock, whenever she receives one from John, 
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according to eqn (A6.5), Peter should receive Mary’s signals at 
intervals of K. 1 K 1 T 0 . However, the interval measured by Peter 
should still be equal to K S T 0 , just as if Mary had not received 
signals and re-transmitted them without time delay. Hence, 


K 3 T 0 = K 2 K 1 T 0 , or = 



Solving for u gives 



This equation is consistent with the velocity transformations 
developed in Section 4.1. 

(j) The Doppler Effect 

Consider again the example illustrated in Figure A6.1(a). Let 
v 0 (= 1 /T 0 ) be the frequency of signals transmitted by John from 
the earth, as measured by John’s clock. Let v' (= 1/7") be the 
frequency of the signals received by Mary on the moving rocket, as 
measured by Mary’s clock. Since by definition, that is eqn (A6.3), 
T' equals KT, we have 

v '=jg (A6.32) 

or, 

F-l 

v ' z= v ° / —l ( A6 - 33 ) 

One can treat each successive maximum of a plane monochromatic 
wave as a signal. Hence eqn (A6.33) should hold for light. Equation 
(A6.33) is the relativistic expression for the Doppler effect, for the 
case when the source and observer are moving directly away from 
each other with uniform relative velocity v. (Compare Sections 
3.10.2, 4.4.4 and 5.8.3.) When the source is going directly away 
from the observer the frequency of the light goes down. 
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Let astronaut James be in a rocket which is at rest relative to the 
earth. Let him be further away from the earth than Mary’s rocket. 
James’ world is shown in Figure A6.1(a). It is parallel to the m — ct 
axis. Consider the radar signals, which John transmits at times 0, 
T 0 , 2 To etc., on John’s clock. Since James is at rest relative to John, 
the time intervals between the successive signals, received by James 
must be equal to T 0 . Let Mary transmit signals, whenever she 
receives one from John at times 0, T', 2 T' on her clock, where 
T' = KT 0 etc. These signals should also reach James at intervals 
of T 0 , just as if there had been no intermediate stage, when Mary 
received and re-transmitted radar signals. Hence, if Mary transmits 
signals of frequency v' 0 (=l/T') measured by her clock, the frequency 
of signals reaching James must be v = (1/T 0 ), when Mary is 
approaching James. Thus if the source is approaching the observer 
directly with uniform velocity v, 



In this case the frequency goes up by a factor K. Notice, if the 
frequency of light from a receding source goes down from v 0 to 
vJK, the frequency of light from the same light source, when the 
source is approaching the observer with the same uniform velocity 
goes up from v 0 to Kv 0 . This fact is used in the gedanken exfierimente 
in Section 8.2. 


REFERENCES 

1 Bondi, H. Discovery 18 (1957) 505 

2 Bondi, H. Relativity and Common Sense. 1965. London; Heinemann Ltd. 
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ANSWERS TO PROBLEMS 


3.3 

3.4 

3.5 
3.8 


3.10 

3.11 
3.13 

3.15 

3.16 

3.17 

3.19 

4.1 

4.2 

4.3 
4.5 


4.6 


4.7 

4.13 

4.14 

4.15 

4.16 
4.18 


y y r r ( COs2 )* 

In 27, L = L 0 -1- sin 2 0'j ; tan 0 = y tan 0'. 

1/(2 X 10 8 ) of its diameter or .—6-37 cm. 

0-866 c. 


(a) The rear of the rocket is 3 X 10 10 m from the earth when 
the light reaches it, 100 sec after the time of emission- 

(b) 0-9 r. 

(a) 1-36 km; (b) 4-6 km; ( c) 14-5 km. 


v = 0-98 c. 

(a) A = -573 A; (b) -2,536 A. 
{a) 0-8 c; ( b) 0-89 c; (c) 0-65 c. 
(a)v = 0-8c; (b) 3 sec. 

Z 2 ‘ 




(a) u x — 45(1 5 x 10 18 ) m/sec; ( b ) u = 30 m/sec, u,. — 

15(1 - 5 X 10-1®) m/sec. V 

(a) u x = 0-83 c, u y = 0; (b) u x = 0-1 c, u y = 0-796 c; (c) 
u x = 0, «„ = 0-798 c; ( d ) 97°1 T to the direction of motion. 
«„ = 0-82 f, u x = 0-395 c. 

For the first case: 

2 _ «' 2 -f g 2 + 2 u'v cos q' — (u'v sin q') 2 /c 2 

[1 + (u’v/c 2 )cos a'] 2 

j VJT— v 2 /c 2 ) u! sin a' 

and tan a = — - - '—f - 

u cos a + v 

The expression for u is symmetrical in v and u\ but tan a 
is not. 


(a) 0-87 c; ( b) 40 stone; (c) She would not appear con¬ 
tracted, but merely rotated relative to the laboratory. The 
colour of her clothes, hair and complexion would change 
(cf. Section 4.4.5). 

(a) 1-6 c; (b) 0-8 c; (c) 0-976 c. 

0-69. 

AA = +6-56 A. 

10-6 m. 

75° 30'. 

(a) 4-24 x 10 18 , 3-76 x 10 18 , 3-39 x 10 18 , 3-08 x 10 18 , 
2-73 x 10 18 c/sec; (b) 1-0: 0-79: 0-64: 0-53: 0-44. 
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4.20 

5.3 

5.4 

5.6 

5.7 

5.12 

5.13 

5.14 

5.15 

5.16 

5.20 

5.21 

5.22 
5.24 

5.26 

5.27 

5.32 

5.33 

5.35 

5.36 
5.40 
6.5 

6.12 

8.4 

8.5 

8.6 

8.7 

8.8 


8.9 

9.2 

9.3 

9.4 


ANSWERS TO PROBLEMS 


(a) AX = 1,333 A; (b) AX = — +840 A; (c) ^53|°. 
B(=vle) is (a) 0-196; ( b ) 0-549; (c) 0-94. 
m/m 0 is (a) 1-02; ( b) 1-197; ( c ) 2-97. 

B(= vjc) is (a) 0-145; (6)0-42; (c) 0-87. 

(a) 0-0536 cm; (b) 0-227 cm; (c) 1-19 cm; (d) 3-80 cm. 
31-5 cm. 

(a) 36; (6) —2 X 10" 18 . 

(a) 34 days; (4) 81 h; (c) 300 sec. /xi1 . ... 

(a) 0-079 MeV; (b) 0-261 MeV. m/m 0 = (a) 1-15; (6) 
1-51. 


(a) 2-56 X 10 5 V; (6) 0-75 c. 

1- 4 X 10- 12 . 

28 MeV. 

(a) 17-3 MeV; (6) -1-1 MeV. 

\a) 4-2 X 10 9 kg/sec; (6) 4-5 X lO" 6 N/m 2 . 

144 MeV/c 2 or 282 electron masses. 

= 277 m„ m 2 = 1840 m e ; {a) —2,380 m e ; ( b ) —500 
MeV/e; (c) —97 MeV. 

£max = 52-8 MeV. 
u cm = 0-138 r; l-0095r. 

129 MeV/c; 228 MeV. 

0-511 MeV. 

(a) 145 MeV; (6) 171 MeV; (c) 5-6 GeV. 

2- 4 X 10 _1 ° sec. 


(a) — c 2 ; (6) -mJA 

(a) 4c/5; (6) 10 years; (c) <lab = 20 years, /rocket 


12 


years. 

v = 0-995 c. 

4-8 X 10 -5 sec. 

(a) 670; ( b) 18-3. 

(a) x = 0-43 light years, r = 0-875 years; (b) x = 9 light 
years, r = 2-94 years; (c) x = 99 light years, t = 5-2 
years. 

(a) 18 light years, r = 11% years; (6) 1998 light years, 
t = 29,-* years. 

Ar/r = 1-35 X 10~ 14 . 

30-5 [x sec. 

A vjv = 8-4 x 10 -11 . 
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A TABLE OF PHYSICAL CONSTANTS 


This is a short list of those constants necessary to solve the problems set in the 
text. For a fuller list the reader is referred to E. R. Cohen, J. W. M. DuMond, 
T. W. Layton and J. S. Rollet Rev. Mod. Phys. 27 (1955) 363. 

Speed of light 

c = 2-997930 X 10 s metre . sec -1 

Electronic charge 

e = 1-60206 X 10-»» Coulomb 

Electron rest mass 

m = 9-1083 x 10 -81 kg = 0-510976 MeV/c 2 
Charge to mass ratio of the electron 

elm = 1-75890 X 10 11 Coulomb . kg- 1 

Proton rest mass 

M, = 1-67239 X 10 -27 kg = 938-211 MeV/e 2 

Neutron rest mass 

M n = 1-67470 X 10 -27 kg = 939-505 MeV/c 2 
Ratio proton mass to electron mass 

MJm - 1836-12 

Atomic mass unit ( 18 0 scale) 

1 a.m.u. = 1-6598 X 10 -27 kg 

Atomic mass unit ( 12 C scale) 

1 a.m.u. = 1-6603 X 10 -27 kg 

Avogadro’s number ( 16 0 scale) 

N = 6-02486 x 10 28 (kg mole)- 1 

Planck’s constant 

h = 6-62517 x 10 -84 Joule . sec 

Boltzmann constant 

k = 1-38044 X 10- 28 Joule . (“K)" 1 

Electric space constant 

e 0 = 8-85434 X 10~ 12 Farads . metre -1 

Magnetic space constant 

/u a = 4i7 X 10- 7 Henrys . metre -1 

Electron volt 

1 eV => 1-60206 X 10 -12 Joule 
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131, 158 

conservation of in elastic collisions, 
113-117, 146-152 
relativistic definition, 119-121 
threshold energy, 171, 172 
Energy (total), 122, 123, 126, 177 
conservation of, 146-153, 164-166, 
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relativistic definition, 118, 177 
relativistic transformations of, 145, 
146, 212, 213 

Foucault’s pendulum, 21, 23 
Four vectors, 205 
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using 4-vectors, 205-214 
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4-force, 211 
4-momentum, 210 
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time-like, 206 
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reference frame) 
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16, 25, 27, 29, 32, 33, 37, 95, 189, 
295 

acceleration transformations, 14 
geometrical representation, 188—190, 
199 

velocity transformations, 14, 27, 89 


Galileo, G., 4 

Gamma [y = (1 — r s /c s )—i] 
definition of, 43 
Gamma ray absorption, 
free nucleus, 313 
without recoil, 316-318 
Gamma ray emission, 
free nucleus, 312, 313 
without recoil, 316-318 
Gamma rays from light, 172-174 
Gamow, G., 157 
Gauge, 192-195 
Gauss’ law, 221 
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260-287 
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Gravitational field, 

bending of light in, 274—276 
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272-274 
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shift of spectral lines in, 267, 268 
Gravitational mass, 7, 263 
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258 

Inelastic collisions, 153-155, 169-172 
Inertial force, 18, 19, 260 
Inertial mass, 1, 6, 7, 117, 263, 274 
determination of, 7 

Inertial reference frame, 4, 22, 23, 36, 
260 

Interval between two events, 78-84, 
198, 207, 238 
definition of, 79 
space-like, 80 
time-like, 80 
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247 
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Janossy, L., 152 
Jiittner, F., 150 


Kaufmann, W., 136 
K-calculus, 319-334 
basic formulae, 323, 324 
Doppler effect, 333, 334 
Lorentz contraction, 328-330 
Lorentz transformations, 324—327 
simultaneity of events, 330-331 
time dilation, 327 
velocity transformations, 332 
Kelvin, Lord, 26, 30 
Kepler, J., 4 
Kilmister, G. W., 306 
Kilogram, primary standard of, 3 
Kinetic energy {see energy, kinetic) 
King, J. G., 138 
Kittel, C., 74 
Kjellman, J., 38, 168 
Klein, O., 179 
Knight, W. D., 74 
Kulikov, O. F., 172 


Lagrangian, relativistic, 135, 136 
Laser, 172-174 
Lebedew, P., 158 
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measurement of, 3, 11, 13, 48, 49, 
85, 328-330 

measurement of perpendicular to 
the direction of motion, 56-58 
moving rocket, 63-65, 328-330 
proper, 48 

Length contraction, 76, 77, 178, 

328-330 

illustration of using geometrical 
methods, 200, 201, 204, 205 
K-calculus, 328-330 
mathematical discussion of, 47-50 
qualitative discussion of, 54-59 
Leptons, conservation of, 166 
Lewis, G. M., 113 
Lieber, H. G., 47 
Lieber, L. R., 47 

Light, bending in a gravitational field, 
274-276 


Light velocity, 

constancy of in vacuo, 29, 33, 34, 
36-39, 42, 51, 91, 167-169, 
299-306, 311, 320, 326 
experimental checks of, 38, 167-169 
limitations of, 276 
limiting role of, 120 
moving medium, 94—96 
Limiting speed of particles, principle of 
constancy of, 39, 129-132, 306- 
308, 310 

application to photons, 169 
Linear electron accelerator, 129-131, 
142 

Loar, H. H., 73, 249 
Lodge, O. J., 294 

Longitudinal sound waves, speed of, 
180 

Lorentz, H. A., 61, 95 
Lorentz-FitzGerald contraction {see 
Length contraction) 

Lorentz force, 123, 135, 309 
Lorentz transformations, 33, 39-45, 58, 
310, 324-327 

co-ordinates and time, 42, 43, 
303-309, 324-326 

derivation of using the principle of 
the constancy of the limiting 
speed of particles, 43, 306—308 
derivation of using the principle of 
the constancy of the speed of 
light, 39-43, 303-306, 324-327 
geometrical representation using real 
variables, 191—201 

geometrical representation using the 
complex, X t = ict variable, 202- 
205 

K-calculus, 324—327 
representation of as a rotation of 
axes in the *i, X*. X t space, 

202-205 

successive transformations, 44, 45, 
90, 91 

Lustig, H., 316, 318 


McCrea, W. H., 104, 180, 239 
Mach, E., 277 
Mach’s principle, 277, 286 
McMillan, E. M., 139 
Magie, W. F., 293 

Magnetic field, motion of charge in, 
133, 134 
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